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Preface 


This book gives an elementary account of thermal physics. The subject is 
simple, the methods are powerful, and the results have broad applications. 
Probably no other physical theory is used more widely throughout science and 
engineering. 

We have written for undergraduate students of physics and astronomy, and 
for electrical engineering students generally. These fields for our purposes have 
strong common bonds, most notably a concern with Fermi gases, whether in 
semiconductors, metals, stars, or nuclei. We develop methods (not original, but 
not easily accessible elsewhere) that are well suited to these fields. We wrote 
the book in the first place because we were delighted by the clarity of the 
“new” methods as compared to those we were taught when we were students 
ourselves. 

The second edition is substantially rewritten and revised from the first 
edition, which, although warmly accepted, suffered from the concentration 
of abstract ideas at the beginning. In the new structure the free energy, the 
partition function, and the Planck distribution are developed before the chem- 
ical potential. Real problems can now be solved much earlier. We have 
added chapters on applications to semiconductors, binary mixtures, transport 
theory, cryogenics, and propagation. The treatment of heat and work is new 
and will be helpful to those concerned with energy conversion processes. Many 
more examples and problems are given, but we have not introduced problems 
where they do not contribute to the main line of advance. For this edition 
an instructor's guide is available from the publisher, upon request from the 
instructor. 

This edition has been tested extensively over the past few years in classroom 
use. We have not emphasized several traditional topics, some because they are 
no longer useful and some because their reliance on classical statistical me- 
chanics would make the course more difficult than we believe a first course 
should be. Also, we have avoided the use of combinatorial methods where they 
are unnecessary. 

For a one quarter course for physics undergraduates, we suggest most of 
Chapters 1 through 10, plus 14. The Debye theory could be omitted from 
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Chapter 4 and the Boltzmann transport equation from Chapter 14. For a one 
quarter course for electrical engineers, we suggest Chapter 13 at any time after 
the discussion of the Fermi gas in Chapter 7. The material of Chapter 13 does 
not draw on Chapter 4. The scope of the book is ample for a one semester 
course, and here the pace can be relaxed. 

Notation and units: We generally use the SI and CGS systems in parallel. 
We do not use the calorie. The kelvin temperature T is related to the funda- 
mental temperature t by t = kgT, and the conventional entropy S$ is related 
to the fundamental entropy ¢ by S = kgo. The symbol log will denote natural 
logarithm throughout, simply because In is less expressive when set in type. 
The notation (18) refers to Equation (18) of the current chapter, but (3.18) 
refers to Equation (18) of Chapter 3. 

The book is the successor to course notes developed with the assistance of 
grants by the University of California. Edward M. Purcell contributed many 
ideas to the first edition. We benefited from review of the second edition by 
Seymour Geller, Paul L. Richards, and Nicholas Wheeler. Help was given by 
Ibrahim Adawi, Bernard Black, G. Domokos, Margaret Geller, Cameron 
Hayne, K. A. Jackson, S. Justi, Peter Kittel, Richard Kittler, Martin J. Klein, 
Ellen Leverenz, Bruce H. J. McKellar, F. E. O'Meara, Norman E. Phillips, 
B. Roswell Russell, T. M. Sanders, B. Stoeckly, John Verhoogen, John Wheatley, 
and Eyvind Wichmann. We thank Carol Tung for the typed manuscript and 
Sari Wilde for her help with the index. 


Berkeley and Santa Barbara Charles Kittel 
Herbert Kroemer 


Note to the Student 


For minimum coverage of the concepts presented in each chapter, the authors 
recommend the following exercises. Chapter 2: 1, 2, 3; Chapter 3: 1, 2, 3, 4, 8, 
11; Chapter 4: 1, 2, 4, 5, 6,8; Chapter 5: 1,3, 4, 6, 8; Chapter 6: 1, 2, 3, 6, 12, 
14, 15; Chapter 7: 2, 3, 5, 6, 7, 11; Chapter 8: 1, 2, 3, 5, 6, 7; Chapter 9: 1, 2, 3; 
Chapter 10: 1, 2, 3: Chapter 11: 1, 2, 3; Chapter 12: 3, 4,5; Chapter 13: 1, 2, 
3, 7, 8, 10; Chapter 14: 1, 3, 4, 5; Chapter 15: 2, 3, 4, 6. 
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Thermal 
Physics 


Introduction 
Se 2s ES PET Ee 


Our approach to thermal physics differs from the tradition followed in beginning 
physics courses, Therefore we provide this introduction to set out what we are 
going to do in the chapters that follow. We show the main lines of the logical 
structure: in this subject all the physics comes from the logic. In order of their 
appearance, the leading characters in our story are the entropy, the temperature, 
the Boltzmann factor, the chemical potential, the Gibbs factor, and the distribu- 
tion functions. 

The entropy measures the number of quantum states accessible to a system. 
A closed system might be in any of these quantum states and (we assume) with 
equal probability. The fundamental statistical element, the fundamental logical 
assumption, is that quantum states are either accessible or inaccessible to the 
system, and the system is equally likely to be in any one accessible state as in 
any other accessible state. Given g accessible states, the entropy is defined as 
o = logg. The entropy thus defined will be a function of the energy U, the 
number of particles N, and the volume V of the system, because these param- 
eters enter the determination of g; other parameters may enter as well. The 
use of the logarithm is a mathematical convenience: it is easier to write 107° 
than exp(107°), and it is more natural for two systems to speak of o, + o2 than 
of 9192. 

When two systems, each of specified energy, are brought into thermal contact 
they may transfer energy; their total energy remains constant, but the constraints 
on their individual energies are lifted. A transfer of energy in one direction, or 
perhaps in the other, may increase the product g,g, that measures the number of 
accessible states of the combined systems. The fundamental assumption biases 
the outcome in favor of that allocation of the total energy that maximizes the 
number of accessible states: more is better, and more likely. This statement is 
the kernel of the law of increase of entropy, which is the general expression of 
the second law of thermodynamics. 

We have brought two systems into thermal contact so that they may transfer 
energy. What is the most probable outcome of the encounter? One system will 
gain energy at the expense of the other, and meanwhile the total entropy of the 
two systems will increase. Eventually the entropy will reach a maximum for 
the given total energy. It is not difficult to show (Chapter 2) that the maximum 
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is attained when the value of (60/0U)y y for one system is equal to the value of 
the same quantity for the second system. This equality property for two systems 
in thermal contact is just the property we expect of the temperature. Accordingly, 
we define the fundamental temperature t by the relation 


t \OU)wy 


The use of 1/z assures that energy will flow from high t to low t; no more com- 
plicated relation is needed. It will follow that the Kelvin temperature T is 
directly proportional to t, with t = kT, where kg is the Boltzmann constant. 
The conventional entropy S is given by § = kgo. 

Now consider a very simple example of the Boltzmann factor treated in 
Chapter 3. Let a small system with only two states, one at energy 0 and one at 
energy ¢, be placed in thermal contact with a large system that we call the 
reservoir. The total energy of the combined systems is Uo; when the small 
system is in the state of energy 0, the reservoir has energy Up and will have 
g(U,) states accessible to it. When the small system is in the state of energy ¢, the 
reservoir will have energy Uy — ¢ and will have g(U» — «) states accessible to 
it. By the fundamental assumption, the ratio of the probability of finding the 
small system with energy «¢ to the probability of finding it with energy 0 is 


P(e) _ g(Uo — &) _ explo(Uo — ®)] 
PO) (Uo) explo(Uo)] 


The reservoir entropy o may be expanded in a Taylor series: 
G(Ug — £) = a(Uo) — e(6a/CU9) = (Uo) — g/t , (3) 


by the definition (1) of the temperature. Higher order terms in the expansion 
may be dropped. Cancellation of the term exp[o(U)], which occurs in the 
numerator and denominator of (2) after the substitution of (3), leaves us with 


P()/P(O) = exp(—e/t). (4) 


This is Boltzmann’s result. To show its use, we calculate the thermal average 
energy <e> of the two state system in thermal contact with a reservoir at tem- 
perature t: 


€exp(—e/t) 


<2 = YeP(e) = 0-P(0) + eP(e) = T exp(=3)) > 


(5) 
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where we have imposed the normalization condition on the sum of the prob- 
abilities: 


P(O) + P(e) = 1. (6) 


The argument can be generalized immediately to find the average energy of a 
harmonic oscillator at temperature t, and we do this in Chapter 4 as the first 
step in the derivation of the Planck radiation law. 

The most important extension of the theory is to systems that can transfer 
particles as well as energy with the reservoir. For two systems in diffusive and 
thermal contact, the entropy will be a maximum with respect to the transfer 
of particles as well as to the transfer of energy. Not only must (éc/éU)y y be 
equal for the two systems, but (@¢/@N)y,, must also be equal, where N refers to 
the number of particles of a given species. The new equality condition is the 
occasion for the introduction of a new quantity, the chemical potential j.: 


“ oa 
Bree) 7 
ec (Fay U 


For two systems in thermal and diffusive contact, t, = t, and pf, = fly. The 
sign in (7) is chosen to ensure that the direction of particle flow as equilibrium 
is approached is from high chemical potential to low chemical potential. 

The Gibbs factor of Chapter 5 is an extension of the Boltzmann factor and 
allows us to treat systems that can transfer particles. The simplest example is a 
system with two states, one with 0 particles and 0 energy, and one with | particle 
and energy ¢. The system is in contact with a reservoir at temperature t and 
chemical potential jz. We extend (3) for the reservoir entropy: 


a(U  — &:No — 1) = a(U3No) — &(00/6U 9) — 1+ (2a/ENo) 
a(Ug;No) — &/t + B/t. (8) 


By analogy with (4), we have 
P(1,2)/P(0,0) = exp[(u — @)/t] , (9) 


for the ratio of the probability the system is occupied by 1 particle at energy « 
to the probability the system is unoccupied, with energy 0. The result (9) after 
normalization is readily expressed as 


1 
P(le) = ene =a (10) 
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This particular result is known as the Fermi-Dirac distribution function and 
is used particularly in the theory of metals to describe the electron gas at low 
temperature and high concentration (Chapter 7). 

The classical distribution function used in the derivation of the ideal gas law 
is just the limit of (10) when the occupancy P(1,2) is much less than 1: 


P(1,e) = exp[(u — «)/t]- (11) 


The properties of the ideal gas are developed from this result in Chapter 6. 

The Helmholtz free energy F = U — to appears as an important computa- 
tional function, because the relation (@F/0t)y,y = —o offers the easiest method 
for finding the entropy, once we have found out how to calculate F from the 
energy eigenvalues (Chapter 3). Other powerful tools for the calculation of 
thermodynamic functions are developed in the text. Most of the remainder of 
the text concerns applications that are useful in their own right and that illumi- 
nate the meaning and utility of the principal thermodynamic functions. 

Thermal physics connects the world of everyday objects, of astronomical 
objects, and of chemical and biological processes with the world of molecular, 
atomic, and electronic systems. It unites the two parts of our world, the micro- 
scopic and the macroscopic. 
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Chapter 1: States of a Model System 


But although, as a matter of history, statistical mechanics owes its origin to 
investigations in thermodynamics, it seems eminently worthy of an independent 
development, both on account of the elegance and simplicity of its principles, 
and because it yields new results and places old truths in a new light in 
departments quite outside of thermodynamics. 


J. W. Gibbs 


A theory is the more impressive the greater the simplicity of its premises, the 
more different kinds of things it relates, and the more extended its area of 
applicability. Therefore the deep impression that classical thermodynamics made 
upon me. It is the only physical theory of universal content which I am convinced 
will never be overthrown, within the framework of applicability of its basic 
concepts. 


A, Einstein 
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Thermal physics is the fruit of the union of statistical and mechanical principles. 
Mechanics tells us the meaning of work; thermal physics tells us the meaning of 
heat. There are three new quantities in thermal physics that do not appear in 
ordinary mechanics: entropy, temperature, and free energy. We shall motivate 
their definitions in the first three chapters and deduce their consequences 
thereafter. 

Our point of departure for the development of thermal physics is the concept 
of the stationary quantum states of a system of particles. When we can count 
the quantum states accessible to a system, we know the entropy of the system, 
for the entropy is defined as the logarithm of the number of states (Chapter 2). 
The dependence of the entropy on the energy of the system defines the tempera- 
ture. From the entropy, the temperature, and the free energy we find the pressure, 
the chemical potential, and all other thermodynamic properties of the system. 

For a system in a stationary quantum state, all observable physical properties 
such as the energy and the number of particles are independent of the time. For 
brevity we usually omit the word stationary; the quantum states that we treat 
are stationary except when we discuss transport processes in Chapters 14—15.+ 
The systems we discuss may be composed of a single particle or, more often, 
of many particles. The theory is developed to handle general systems of inter- 
acting particles, but powerful simplifications can be made in special problems 
for which the interactions may be neglected. 

Each quantum state has a definite energy. States with identical energies are 
said to belong to the same energy level. The multiplicity or degeneracy of an 
energy level is the number of quantum states with very nearly the same energy. 
It is the number of quantum states that is important in thermal physics, not 
the number of energy levels. We shall frequently deal with sums over all quantum 
states, Two states at the same energy must always be counted as two states, 
not as one level. 

Let us look at the quantum states and energy levels of several atomic systems. 
The simplest is hydrogen, with one electron and one proton. The low-lying 
energy levels of hydrogen are shown in Figure 1.1. The zero of energy in the 
figure is taken at the state of lowest energy. The number of quantum states 
belonging to the same energy level is in parentheses. In the figure we overlook 
that the proton has a spin of 5h and has two independent orientations, parallel 
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Figure 1.1 Low-lying energy levels of atomic hydrogen, lithium, and boron, The 
energies are given in electron volts, with 1 eV = 1.602 x 10~!? erg. The numbers in 
parentheses give the number of quantum states having the same energy, with no account 
taken of the spin of the nucleus. The zero of energy in the figure is taken for convenience 
at the lowest energy state of each atom. 


or antiparallel to the direction of an arbitrary external axis, such as the direction 
of a magnetic field. To take account of the two orientations we should double 
the values of the multiplicities shown for atomic hydrogen. 

An atom of lithium has three electrons which move about the nucleus. Each 
electron interacts with the nucleus, and each electron also interacts with all the 
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30 Representative set 
Multiplicity of quantum numbers 
n 


Energy, relative scale 
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Figure 1.2 Energy levels, multiplicities, and quantum numbers 
nx, ny, n. Of a particle confined to a cube. 


other electrons. The energies of the levels of lithium shown in the figure are the 
collective energies of the entire system. The energy levels shown for boron, which 
has five electrons, are also the energies of the entire system. 

The energy ofa system is the total energy of all particles, kinetic plus potential, 
with account taken of interactions between particles. A quantum state of the 
system is a state of all particles. Quantum states of a one-particle system are 
called orbitals. The low-lying energy levels of a single particle of mass M con- 
fined to a cube of side L are shown in Figure 1.2. We shall find in Chapter 3 
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that an orbital of a free particle can be characterized by three positive integral 
quantum numbers n,, n,, 7_. The energy is 


Rete. ; 
e= walt) (n.? + n,? +n’). () 


The multiplicities of the levels are indicated in the figure. The three orbitals 
with (n,,n,,.) equal to (4,1,1), (1,4,1), and (1,1,4) all have n,? + n,? + n,? = 18: 
the corresponding energy level has the multiplicity 3. 

To describe the statistical properties of a system of N particles, it is essential 
to know the set of values of the energy «,(N), where ¢ is the energy of the quantum 
state s of the N particle system. Indices such as s may be assigned to the quantum 
states in any convenient arbitrary way, but two different states should not be 
assigned the same index. 

It is a good idea to start our program by studying the properties of simple 
model systems for which the energies ¢,(V) can be calculated exactly. We choose 
as a model a simple binary system because the general statistical properties 
found for the model system are believed to apply equally well to any realistic 
physical system. This assumption leads to predictions that always agree with 
experiment. What general statistical properties are of concern will become clear 
as we go along. 


BINARY MODEL SYSTEMS 


The binary model system is illustrated in Figure 1.3. We assume there are N 
separate and distinct sites fixed in space, shown for convenience on a line. 
Attached to each site is an elementary magnet that can point only up or down, 
corresponding to magnetic moments +m. To understand the system means to 
count the states. This requires no knowledge of magnetism: an element of the 
system can be any site capable of two states, labeled as yes or no, red or blue, 
occupied or unoccupied, zero or one, plus one or minus one. The sites are 
numbered, and sites with different numbers are supposed not to overlap in 
physical space. You might even think of the sites as numbered parking spaces in 
a car parking lot, as in Figure 1.4. Each parking space has two states, vacant or 
occupied by one car. 

Whatever the nature of our objects, we may designate the two states by 
arrows that can only point straight up or straight down. If the magnet points 
up, we say that the magnetic moment is +m. If the magnet points down, the 
magnetic moment is —m. 


Binary Model Systems 


MEET ETT 


Number of the site 


Figure 1.3. Model system composed of 10 elementary 
magnets at fixed sites on a line, each having magnetic 
moment +m. The numbers shown are attached to the sites; 
each site has its own magnet. We assume there are no 
interactions among the magnets and there is no external 
magnetic field. Each magnetic moment may be oriented in 
two ways, up or down, so that there are 2!° distinct 
arrangements of the 10 magnetic moments shown in the 
figure. If the arrangements are selected in a random process, 
the probability of finding the particular arrangement shown 
is 1/2°, 


Figure 1.4 State of a parking lot with 10 numbered parking 
spaces. The @’s denote spaces occupied by a car; the O's 
denote vacant spaces. This particular state is equivalent to that 
shown in Figure 1.3. 


Now consider N different sites, each of which bears a moment that may 
assume the values +m. Each moment may be oriented in two ways with a 
probability independent of the orientation of all other moments. The total 
number of arrangements of the N moments is 2 x 2 x 2x -:-2=2..A 
state of the system is specified by giving the orientation of the moment on each 
site; there are 2 states. We may use the following simple notation for a single 
state of the system of N sites: 


TIUNTMIT +. (2) 
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Figure 1.5 The four different states of a 
system of two elements numbered 1 and 2, 
where each element can have two conditions. 
2 ta Pee cre 


The element is a magnet which can be in 
condition } or condition |. 


The sites themselves are assumed to be arranged in a definite order. We may 
number them in sequence from left to right, as we did in Figure 1.3. According 
to this convention the state (2) also can be written as 


Tifalslalstolatefotio:*°- (3) 


Both sets of symbols (2) and (3) denote the same state of the system, the state 
in which the magnetic moment on site 1 is +m; on site 2, the moment is +m; 
on site 3, the moment is —m; and so forth. 

It is not hard to convince yourself that every distinct state of the system is 
contained in a symbolic product of N factors: 


(ti + Lita + Lats + d3)°°* (tw + Ly) (4) 
The multiplication rule is defined by 
(Tr + Lids + Lo) = Tite + Tile + dite + dite. (5) 


The function (4) on multiplication generates a sum of 2% terms, one for each of 
the 2” possible states. Each term is a product of N individual magnetic moment 
symbols, with one symbol for each elementary magnet on the line. Each term 
denotes an independent state of the system and is a simple product of the form 
Titola::: Ty. for example. 

For a system of two elementary magnets, we multiply (},; + |,) by (72 + 12) 
to obtain the four possible states of Figure 1.5: 


(tr + Lita + 42) = Tite + Tide + dite + Lido. (6) 


The sum is not a state but is a way of listing the four possible states of the system. 
The product on the left-hand side of the equation is called a generating function: 
it generates the states of the system. 
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The generating function for the states of a system of three magnets is 


(ti + Li(f2 + La)ts + 1). 


This expression on multiplication generates 2° = 8 different states: 


Three magnets up: Titats 


Two magnets up: Tifals Tilats ditats 
One magnet up: Tilals ditals dilats 
None up: dilols- 


The total magnetic moment of our model system of N magnets each of 
magnetic moment m will be denoted by M, which we will relate to the energy 
in a magnetic field. The value of M varies from Nm to — Nm. The set of possible 
values is given by 


M =Nm, (N — 2)m, (N — 4)m, (N — 6)m, --:, —Nm. (7) 


The set of possible values of M is obtained if we start with the state for which all 
magnets are up(M = Nm)and reverse one at a time. We may reverse N magnets 
to obtain the ultimate state for which all magnets are down (M = — Nm). 
There are N + 1 possible values of the total moment, whereas there are 2% 
states. When N > 1, we have 2% > N + 1. There are many more states than 
values of the total moment. If N = 10, there are 2'° = 1024 states distributed 
among 11 different values of the total magnetic moment. For large N many 
different states of the system may have the same value of the total moment M. 
We will calculate in the next section how many states have a given value of M. 
Only one state of a system has the moment M = Nm; that state is 


TMT + TTT. (8) 


There are N ways to form a state with one magnet down: 


ts>- Tt (9) 


is one such state; another is 


Tits <1 (10) 
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and the other states with one magnet down are formed from (8) by reversing 
any single magnet. The states (9) and (10) have total moment M = Nm — 2m. 


Enumeration of States and the Multiplicity Function 


We use the word spin as a shorthand for elementary magnet. It is convenient to 


assume that N is an even number. We need a mathematical expression for the 
number of states with N, = 4N + s magnets up and N, = 3N — s magnets 
down, where s is an integer. When we turn one magnet from the up to the down 
orientation, 4N + s goes to $N + s — 1 and 4N — s goes to 3N—s +1. 
The difference (number up — number down) changes from 2s to 2s — 2. The 
difference 


N,; — N, = 2s (11) 


is called the spinexcess. The spin excess of the 4 states in Figure 1.5 is 2, 0,0, — 2, 
from left to right. The factor of 2 in (11) appears to be a nuisance at this stage, 
but it will prove to be convenient. 

The product in (4) may be written symbolically as 


Gay" 
We may drop the site labels (the subscripts) from (4) when we are interested 
only in how many of the magnets in a state are up or down, and not in which 


particular sites have magnets up or down. If we drop the labels and neglect 
the order in which the arrows appear in a given product, then (5) becomes 


(+ YP = t+ 2t + Us 
further, 
(t+)? = Tt + 3tTl + 3tld + LU. 
We find (t + |)% for arbitrary N by the binomial expansion 
(x + y)¥ = x% + Nx~ty + EN(N — 1)x¥~2y? 4 +--+ + y 


=F hangs (12) 


Enumeration of States and the Multiplicity Function 


We may write the exponents of x and y ina slightly different, but equivalent, 
form by replacing t with 3N — s: 


+ yy yi N! 
”* GN + SIGN — 3)! 


XN +s yds, (13) 
With this result the symbolic expression (t + |) becomes 


N! 
(+ "= Lanz oran—ol 


pants panes: (14) 
The coefficient of the term in t#%** |#%~* is the number of states having 
N, =4N +5 magnets up and N, = 4N — s magnets down. This class of 
states has spin excess N; — N, = 2s and net magnetic moment 2sm. Let us 
denote the number of states in this class by g(N,s), for a system of N magnets: 


ri 1 


N! N! 
aN.) = GN gIGN — 9! NIN Ned 
Thus (14) is written as 
4N 
Cire = 7D) GUN) gE EAN (16) 


s=-4N 


We shall call g(N,s) the multiplicity function; it is the number of states having 
the same value of s. The reason for our definition emerges when a magnetic 
field is applied to the spin system: in a magnetic field, states of different values of 
s have different values of the energy, so that our g is equal to the multiplicity 
of an energy level in a magnetic field. Until we introduce a magnetic field, all 
states of the model system have the same energy, which may be taken as zero. 
Note from (16) that the total number of states is given by 

s= 4N 
pSogiey Ps ein Ce a 9-8 (17) 


s=—4N 


Examples related to g(N,s) for N = 10 are given in Figures 1.6 and 1.7. Fora 


coin. “heads” could stand for “magnet uv” and “tails” could stand for “magnet 


down.” 
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Figure 1.6 Number of distinct arrangements 
of 5 + s spins up and 5 — s spins down. 
Values of g(N,s) are for N = 10, where 2s is 
the spin excess NT— NJ. The total number of 
States is 


s 
2% = Y g(10,s). 
s=-5 
The values of the g’s are taken from a table of 
the binomial coefficients. 


[=8)| =A) 
-10 -6 —- 
Spin excess 2s 


0246 810 
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Binary Alloy System 

To illustrate that the exact nature of the two states on each site is irrelevant to 
the result, we consider an alternate system—an alloy crystal with N distinct 
sites, numbered from 1 through 12 in Figure 1.8. Each site is occupied by either 
an atom of chemical species A or an atom of chemical species B, with no provi- 
sion for vacant sites. In brass, A could be copper and B zinc. In analogy to (3), 
a single state of the alloy system can be written as 


A,B,B3A,BsA¢B-BgBoA 10A; aA: (18) 
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Number of heads 


Figure 1.7 An experiment was done in which 10 pennies 
were thrown 100 times. The number of heads in each 
throw was recorded. 


1 2 3 4 
Figure 1.8 A binary alloy system of two 
chemical components A and B, whose atoms 
5 6 7 8 occupy distinct numbered sites. 
9 10 ll 
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Every distinct state of a binary alloy system on N sites is contained in the 
symbolic product of N factors: 


(Ay + By)(A2 + B,)(A3 + B3)-**(Ay +By) (19) 


in analogy to (4). The average composition of a binary alloy is specified con- 
ventionally by the chemical formula A, _.B,, which means that out of a total 
of N atoms, the number of A atoms is N, = (1 —x)N and the number of B 
atoms is Ng = XN. Here x lies between 0 and 1. 

The symbolic expression 


A+ By 5 Ngai 20) 
oD) ~ 2 oe ( 


is analogous to the result (12). The coefficient of the term in A’~' BY gives the 
number g(N,t) of possible arrangements or states of N — ¢ atoms A and t 
atoms B on N sites: 


N! N! 


aN.) = Opal = Nal Nal’ 


(21) 


which is identical to the result (15) for the spin model system, except for notation. 


Sharpness of the Multiplicity Function 


We know from common experience that systems held at constant temperature 
usually have well-defined properties; this stability of physical properties is a 
major prediction of thermal physics. The stability follows as a consequence of 
the exceedingly sharp peak in the multiplicity function and of the steep variation 
of that function away from the peak. We can show explicitly that for a very 
large system, the function g(N,s) defined by (15) is peaked very sharply about 
the value s = 0. We look for an approximation that allows us to examine the 
form of g(N,s) versus s when N > 1 and |s| « N. We cannot look up these 
values in tables: common tables of factorials do not go above N = 100, and we 
may be interested in N ~ 107°, of the order of the number of atoms in a solid 
specimen big enough to be seen and felt. An approximation is clearly needed, 
and a good one is available. 

It is convenient to work with logg. Except where otherwise specified, all 
logarithms are understood to be log base e, written here as log. The international 
standard usage is In for log base e, but it is clearer to write log when there is no 
ambiguity whatever. When you confront a very, very large number such as 
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2%, where N = 107°, it is a simplification to look at the logarithm of the number. 
We take the logarithm of both sides of (15) to obtain 


log g(N,s) = log N'! — log($N + s)! — log($N — s)! , (22) 
by virtue of the characteristic property of the logarithm of a product: 
logxy = logx + logy; log(x/y) = logx — log y. (23) 
With the notation 
N;=4N+s; N,=4N-s (24) 
for the number of magnets up and down, (22) appears as 
logg(N,s) = logN! — logN;! — log N,!. (25) 


We evaluate the logarithm of N! in (25) by use of the Stirling approximation, 
according to which 


N! = (2nN)!2N*% exp[—N + 1/(12N) +°°'] , (26) 


for N > 1. This result is derived in Appendix A. For sufficiently large N, the 
terms 1/(12N) + --- in the argument may be neglected in comparison with N. 
We take the logarithm of both sides of (26) to obtain 


log N! = 4log2x + (N + 4)logN — N. (27) 

Similarly 
log N,! = dlog2z + (N; + 4)logN, — Ny; (28) 
log N,! = $log2m + (Ni + 4)logN, — Ni. (29) 


After rearrangement of (27), 
log N! = $log(2n/N) + (N; +4 + Ni + SlogN —(N;+N,), (30) 


where we have used N = N; + N,. Wesubtract (28) and (29) from (30) to obtain 
for (25): 


logg = $log(1/2nN) — (N; + 3)log(N,/N) — (N, + 3)log(N,/N). (31) 
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This may be simplified because 


log(N;/N) = log4(1 + 2s/N) = —log2 + log(1 + 2s/N) 
= —log2 + (2s/N) — (2s?/N?) (32) 


by virtue of the expansion log(1 + x) = x — 4x? +--+, valid for x « 1. 
Similarly, 


log(N,/N) = log4(1 — 2s/N) ~ —log2 — (2s/N) — (2s/N7?). (33) 
On substitution in (31) we obtain 
logg = 4log(2/nN) + Nlog2 — 2s?/N. (34) 


We write this result as 


g(N,s) = g(N,0)exp(—2s?/N) , (35) 


where 
g(N,0) = (2/nN)*/?2%. (36) 


Such a distribution of values of s is called a Gaussian distribution. The integral* 
of (35) over the range — 00 to +00 for s gives the correct value 2for the total 
number of states. Several useful integrals are treated in Appendix A. 

The exact value of g(N,0) is given by (15) with s = 0: 


N! 


ay any Gn 


g(N,0) = 


* The replacement of a sum by an integral, such as XC. .) by f(.. .)ds, usually does not introduce 
significant errors. For example, the ratio of 


Ls N 
Ys=H(N7+N) to fi'sds = 4N? 


s=0 


is equal to 1 + (1/N), which approaches 1 as N approaches co. 
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g(100, s) X 10-78 


For N = 50,'the value of g(50,0) is 1.264 x 10**, from (37). The approximate 
value from (36) is 1.270 x 10'*. The distribution plotted in Figure 1.9 is centered 
in a maximum at s = 0. When s? = 4N, the value of g is reduced to e~' of the 
maximum value. That is, when 


s/N = (1/2N)"2 , (38) 


the value of g is e~' of g(N,0). The quantity (1/2N)’/? is thus a reasonable mea- 
sure of the fractional width of the distribution. For N ~ 107’, the fractional 
width is of the order of 10~!!. When N is very large, the distribution is exceed- 
ingly sharply defined, in a relative sense. It is this sharp peak and the continued 
sharp variation of the multiplicity function far from the peak that will lead toa 
prediction that the physical properties of systems in thermal equilibrium are 
well defined. We now consider one such property, the mean value of s?. 


Figure 1.9 The Gaussian approximation to 
the binomial coefficients g(100,s) plotted on a 
linear scale. On this scale it is not possible to 
distinguish on the drawing the approximation 
from the exact values over the range of s 
plotted. The entire range of s is from —50 to 
+50. The dashed lines are drawn from the 
points at 1/e of the maximum value of g. 
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AVERAGE VALUES 


The average value, or mean value, of a function f(s) taken over a probability 
distribution function P(s) is defined as 


<f> = LAs) Pls) , (39) 


provided that the distribution function is normalized to unity: 


STO Sak (40) 


The binomial distribution (15) has the property (17) that 


DgIN,s) = 2" , (41) 


and is not normalized to unity. If all states are equally probable, then P(s) = 
g(N,s)/2", and we have Y P(s) = 1. The average of f(s) over this distribution 
will be 


<P> = X Als) PN,s). (42) 


Consider the function f(s) = s?. In the approximation that led to (35) and 
(36), we replace in (42) the sum ) over s by an integral | --- ds between — 00 
and +00. Then 


(2/nN)"/? 2 il ds s? exp(—2s?/N) 
<3?) = 7 ; 
= (2/nN)"? (N/2)9? [* dxxte™ 
= (2/nN)?? (N/2)°? (n/4)'? , 
whence 


<s*) =4N; (2s?) = N. (43) 


The quantity <(2s)> is the mean square spin excess. The root mean square 
spin excess is 


K(25)2) Vti= IN (44) 
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and the fractional fluctuation in 2s is defined as 


KO? 1 


N JN’ 


The larger N is, the smaller is the fractional fluctuation. This means that the 
central peak of the distribution function becomes relatively more sharply 
defined as the size of the system increases, the size being measured by the 
number of sites N. For 107° particles, # = 10~'°, which is very small. 


F (45) 
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The thermal properties of the model system become physically relevant when 
the elementary magnets are placed in a magnetic field, for then the energies of 
the different states are no longer all equal. If the energy of the system is specified, 
then only the states having this energy may occur. The energy of interaction 
of a single magnetic moment m with a fixed external magnetic field B is 


U = -m-B. (46) 


This is the potential energy of the magnet m in the field B. 
For the model system of N elementary magnets, each with two allowed 
orientations in a uniform magnetic field B, the total potential energy U is 


N N 
U = )U, = —B- )m, = —2smB = —-MB, (47) 
i=1 


using the expression M for the total magnetic moment 2sm. In this example the 
spectrum of values of the energy U is discrete. We shall see later that a con- 
tinuous or quasi-continuous spectrum will create no difficulty. Furthermore, 
the spacing between adjacent energy levels of this model is constant, as in 
Figure 1.10. Constant spacing is a special feature of the particular model, but 
this feature will not restrict the generality of the argument that is developed in 
the following sections. 

The value of the energy for moments that interact only with the external 
magnetic field is completely determined by the value of s. This functional 
dependence is indicated by writing U(s). Reversing a single moment lowers 
2s by —2, lowers the total magnetic moment by —2m, and raises the energy 
by 2mB. The energy difference between adjacent levels is denoted by Ae, where 


Ac = U(s) — U(s + 1) = 2mB. (48) 
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s U(s)/mB g(s) log g(s) 


——q« 
—_—— 


Figure 1.10 Energy levels of the model system of 10 
magnetic moments m in a magnetic field B. The levels 
are labeled by their s values, where 2s is the spin excess 
and $N + s = 5 + sis the number of up spins. The 
energies U(s) and multiplicities g(s) are shown. For this 
problem the energy levels are spaced equally, with 
separation Ac = 2mB between adjacent levels. 
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Example: Multiplicity function for harmonic oscillators. The problem of the binary model 
system is the simplest problem for which an exact solution for the multiplicity function is 
known. Another exactly solvable problem is the harmonic oscillator, for which the solution 
was originally given by Max Planck. The original derivation is often felt to be not entirely 
simple. The beginning student need not worry about this derivation. The modern way to 
do the problem is given in Chapter 4 and is simple. 

The quantum states of a harmonic oscillator have the energy eigenvalues 


& = sho , (49) 


where the quantum number s is a positive integer or zero, and is the angular frequency of 
the oscillator. The number of states is infinite, and the multiplicity of each is one. Now 
consider a system of N such oscillators, all of the same frequency. We want to find the 
number of ways in which a given total excitation energy 


N 
e= 23 sho = nho (50) 


=i 
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can be distributed among the oscillators. That is, we want the multiplicity function g(N,n) 
for the N oscillators. The oscillator multiplicity function is not the same as the spin multi- 
plicity function found earlier. 

We begin the analysis by going back to the multiplicity function for a single oscillator, 
for which g(1,n) = 1 for all values of the quantum number s, here identical to n. To solve the 
problem of (53) below, we need a function to represent or generate the series 


¥Y gn" = . rm (51) 
n=0 n=0 


Here t is just a temporary tool that will help us find the result (53), but t does not appear 
in the final result. The answer is 


ey te (52) 


provided we assume |t| < 1. For the problem of N oscillators, the generating function is 


1 


N 2 N ca 
(4, = (= r) = Y g(Nnv" , (53) 
s=0 n=0 


because the number of ways a term ¢" can appear in the N-fold product is precisely the 
number of ordered ways in which the integer n can be formed as the sum of N non-negative 
integers. 

We observe that 


= MN + IN + 2)--(N + n= 2) (54) 


Thus for the system of oscillators, 


(N +n — 1)! 


oN) = TN Dt 


(55) 


This result will be needed in solving a problem in the next chapter. 
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SUMMARY 


1. The multiplicity function for a system of N magnets with spin excess 2s = 
N; — Nyis 


N! N! 
aN.) = Gy gGN a! NENT 


In the limit s/N « 1, with N >» 1, we have the Gaussian approximation 
g(N,s) = (2/nN)'/22" exp(—2s?/N). 
2. Ifall states of the model spin system are equally likely, the average value of 
s? is 


<s*> = J dss*g(N,s) / [%dsg(N.s) = 4N , 


in the Gaussian approximation. 


3. The fractional fluctuation of s? is defined as <s*)'/?/N and is equal to 
1/2N1/2, 


4. The energy of the model spin system in a state of spin excess 2s is 
U(s) = —2smB , 


where m is the magnetic moment of one spin and B is the magnetic field. 
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Note on problems: The method of this chapter can be used to solve some problems, as illustrated 
by Problems 1, 2, and 3. Because much simpler methods are developed in Chapter 3 and later, 
we do not emphasize problem solving at this stage. 
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One should not imagine that two gases in a 0.1 liter container, initially unmixed, 
will mix, then again after a few days separate, then mix again, and so forth. On 
the contrary, one finds . . . that not until a time enormously long compared to 
10'°"° years will there by any noticeable unmixing of the gases. One may 
recognize that this is practically equivalent to never. . . . 


L. Boltzmann 


If we wish to find in rational mechanics an a priori foundation for the principles 
of thermodynamics, we must seek mechanical definitions of temperature and 
entropy. 

J. W. Gibbs 


The general connection between energy and temperature may only be established 
by probability considerations. [Two systems] are in statistical equilibrium when 
a transfer of energy does not increase the probability. 


M. Planck 


Fundamental Assumption 


We start this chapter with a definition of probability that enables us to 
define the average value of a physical property of a system. We then consider 
systems in thermal equilibrium, the definition of entropy, and the definition of 
temperature. The second law of thermodynamics will appear as the law of 
increase of entropy. This chapter is perhaps the most abstract in the book. The 
chapters that follow will apply the concepts to physical problems. 


FUNDAMENTAL ASSUMPTION 


The fundamental assumption of thermal physics is that a closed system is equally 
likely to be in any of the quantum states accessible to it. All accessible quantum 
states are assumed to be equally probable—there is no reason to prefer some 
accessible states over other accessible states. 

A closed system will have constant energy, a constant number of particles, 
constant volume, and constant values of all external parameters that may 
influence the system, including gravitational, electric, and magnetic fields. 

A quantum state is accessible if its properties are compatible with the physical 
specification of the system: the energy of the state must be in the range within 
which the energy of the system is specified, and the number of particles must be 
in the range within which the number of particles is specified. With large systems 
we can never know either of these exactly, but it will suffice to have 6U/U « 1 
and 6N/N « 1. 

Unusual properties of a system may sometimes make it impossible for 
certain states to be accessible during the time the system is under observation. 
For example, the states of the crystalline form of SiO, are inaccessible at low 
temperatures in any observation that starts with the glassy or amorphous 
form: fused silica will not convert to quartz in our lifetime in a low-temperature 
experiment. You will recognize many exclusions of this type by common sense. 
We treat all quantum states as accessible unless they are excluded by the 
specification of the system (Figure 2.1) and the time scale of the measurement 
process, States that are not accessible are said to have zero probability. 

Of course, it is possible to specify the configuration of a closed system to a 
point that its statistical properties as such are of no interest. If we specify that the 
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Figure 2.1 A purely symbolic diagram: each solid spot 
represents an accessible quantum state of a closed system. The 
fundamental assumption of statistical physics is that a closed 
system is equally likely to be in any of the quantum states 
accessible to it. The empty circles represent some of the states 
that are not accessible because their properties do not satisfy 
the specification of the system. 


system is exactly in a stationary quantum state s, no statistical aspect is left in 
the problem. 


PROBABILITY 


Suppose we have a closed system that we know is equally likely to be in any 
of the g accessible quantum states. Let s be a general state label (and not one-half 
the spin excess). The probability P(s) of finding the system in this state is 


P(s) = 1/9 (1) 


if the state s is accessible and P(s) = 0 otherwise, consistent with the fun- 
damental assumption. We shall be concerned later with systems that are not 
closed, for which the energy U and particle number N may vary. For these 
systems P(s) will not be a constant as in (1), but will have a functional dependence 
on U and on N. 


Probability 


The sum } P(s) of the probability over all states is always equal to unity, 
because the total probability that the system is in some state is unity: 


¥P(s) = 1. @) 


The probabilities defined by (1) lead to the definition of the average value of 
any physical property. Suppose that the physical property X has the value 
X(s) when the system is in the state s. Here X might denote magnetic moment, 
energy, square of the energy, charge density near a point r, or any property that 
can be observed when the system is in a quantum state. Then the average of the 
observations of the quantity X taken over a system described by the proba- 


bilities P(s) is 
<X> = DX(P(). | @) 


This equation defines the average value of X. Here P(s) is the probability that 
the system is in the state s. The angular brackets <:--) are used to denote 
average value. 

For a closed system, the average value of X is 


«X> = LY X(9)(1/9) . (4) 


because now all g accessible states are equally likely, with P(s) = 1/g. The 
average in (4) is an elementary example of what may be called an ensemble 
average: we imagine g similar systems, one in each accessible quantum state. 
Such a group of systems constructed alike is called an ensemble of systems. The 
average of any property over the group is called the ensemble average of that 
property. 

An ensemble of systems is composed of many systems, all constructed alike. 
Each system in the ensemble is a replica of the actual system in one of the 
quantum states accessible to the system. If there are g accessible states, then 
there will be g systems in the ensemble, one system for each state. Each system 
in the ensemble is equivalent for all practical purposes to the actual system. 
Each system satisfies all external requirements placed on the original system 
and in this sense is “just as good” as the actual system. Every quantum state 
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Figure 2.2 This ensemble a through j represents a system 
of 10 spins with energy —8mB and spin excess 2s = 8. 
The multiplicity g(N,s) is g(10,4) = 10, so that the 
representative ensemble must contain 10 systems. The 
order in which the various systems in the ensemble are 
listed has no significance. 


accessible to the actual system is represented in the ensemble by one system in a 
stationary quantum state, as in Figure 2.2. We assume that the ensemble 
represents the real system—this is implied in the fundamental assumption. 


Example: Construction of an ensemble. We construct in Figure 2.3 an ensemble to 
represent a closed system of five spins, each system with spin excess 2s = 1. The energy of 
each in a magnetic field is identical. (Do not confuse the use of s in spin excess with our 
frequent use of s as a state index or label.) Each system represents one of the multiples of 


Most Probable Configuration 


Figure 2.3 The ensemble represents a system with N = 5 spins and spin excess 2s = 1. 


Figure 2.4 With N = 5 and 2s = 5, a single 


ca « + ap 58 system may represent the ensemble. This is not 


a typical situation. 


quantum states at this energy. The number of such states is given by the multiplicity function 
(1.15): 


3! 


anor 2 


9(5,3) = 


The 10 systems shown in Figure 2.3 make up the ensemble. 

If the energy in the magnetic field were such that 2s = 5, then a single system comprises 
the ensemble, as in Figure 2.4. In zero magnetic field, all energies of all 2" = 2° = 32 states 
are equal, and the new ensemble must represent 32 systems, of which 1 system has 2s = 5; 
5 systems have 2s = 3; 10 systems have 2s = 1; 10 systems have 2s = —1; 5 systems 
have 2s = —3; and 1 system has 2s = —S. 


Most Probable Configuration 


Let two systems 8, and 8, be brought into contact so that energy can be 
transferred freely from one to the other. This is called thermal contact (Figure 
2.5). The two systems in contact form a larger closed system 8 = 8, + 8, 
with constant energy U = U, + U3. What determines whether there will be a 
net flow of energy from one system to another? The answer leads to the concept 
of temperature. The direction of energy flow is not simply a matter of whether 
the energy of one system is greater than the energy of the other, because the 
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Two closed Uy U, 

systems not Ny Ny 

in contact 8, 8, 
Insulation 


The systems are in U, + U, =U, + U; 


thermal contact 


Thermal conductor allows 
exchange of energy 


Insulation 


Figure 2.5 Establishment of thermal contact between two systems 8, and 8,. 


systems can be different in size and constitution. A constant total energy can be 
shared in many ways between two systems. 

The most probable division of the total energy is that for which the combined 
system has the maximum number of accessible states. We shall enumerate the 
accessible states of two model systems and then study what characterizes the 
systems when in thermal contact. We first solve in detail the problem of thermal 
contact between two spin systems, | and 2, ina magnetic field which is introduced 
in order to define the energy. The numbers of spins N,, N2 may be different, and 
the values of the spin excess 2s,, 2s, may be different for the two systems. All 
spins have magnetic moment m. The actual exchange of energy might take place 
via some weak (residual) coupling between the spins near the interface between 
the two systems. We assume that the quantum states of the total system 8 can 
be represented accurately by a combination of any state of 8, with any state of 
8,. We keep N,, N2 constant, but the values of the spin excess are allowed to 
change. The spin excess of a state of the combined system will be denoted by 2s, 
where s = s; + S3. The energy of the combined system is directly proportional 
to the total spin excess: 


U(s) = U,(s,) + Ux(s2) = —2mB(s, + s2) = —2mBs. (5) 


The total number of particles is N = N, + N>. 


Most Probable Configuration 


We assume that the energy splittings between adjacent energy levels are equal 
to 2mB in both systems, so that the magnetic energy given up by system 1 when 
one spin is reversed can be taken up by the reversal of one spin of system 2 in 
the opposite sense. Any large physical system will have enough diverse modes of 
energy storage so that energy exchange with another system is always possible. 
The value of s = s; + s, is constant because the total energy is constant, but 
when the two systems are brought into thermal contact a redistribution is 
permitted in the separate values of s,, s, and thus in the energies U,, U2. 

The multiplicity function g(N,s) of the combined system § is related to the 
product of the multiplicity functions of the individual systems 8, and 8, by 
the relation: 


g(N,s) = YailNi81)ga(N 28 Fi83) > (6) 


where the multiplicity functions g,, g2 are given by expressions of the form of 
(1.15). The range of s, in the summation is from —4N, to $N,, if N, < N2. 
To see how (6) comes about, consider first that configuration of the combined 
system for which the first system has spin excess 2s, and the second system has 
spin excess 2s,. A configuration is defined as the set of all states with specified 
values of s, and s,. The first system has g,(N,,s,) accessible states, each of which 
may occur together with any of the g,(N>,s,) accessible states of the second 
system. The total number of states in one configuration of the combined system 
is given by the product g,(N,,8;)g2(N2.52) of the multiplicity functions of 8, 
and 8,. Because s, = s — s,, the product of the g’s may be written as 


91(N1,81)92(N 2,8 — 5). (7) 


This product forms one term of the sum (6). 

Different configuration of the combined system are characterized by different 
values of s;. We sum over all possible values of s, to obtain the total number of 
states of all the configurations with fixed s or fixed energy. We thus obtain (6), 
where g(N,s) is the number of accessible states of the combined system. In the 
sum we hold s, N,, and N, constant, as part of the specification of thermal 
contact. 

The result (6) is a sum of products of the form (7). Such a product will be a 
maximum for some value of s,, say §,, to be read as “s,; hat” or “s, caret”. 
The configuration for which g,g, is a maximum is called the most probable 
configuration; the number of states in it is 


9:(N1,8;)92(N2,8 — $)). (8) 
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Figure 2.6 Schematic representation of the dependence of the 
configuration multiplicity on the division of the total energy 
between two systems, 8, and 85. 


If the systems are large, the maximum with respect to changes in s, will be 
extremely sharp, as in Figure 2.6. A relatively small number of configurations 
will dominate the statistical properties of the combined system. The most 
probable configuration alone will describe many of these properties. 

Such a sharp maximum is a property of every realistic type of large system 
for which exact solutions are available; we postulate that it is a general property 
of all large systems. From the sharpness property it follows that fluctuations 
about the most probable configuration are small, in a sense that we will define. 

The important result follows that the values of the average physical properties 
of a large system in thermal contact with another large system are accurately 
described by the properties of the most probable configuration, the configura- 
tion for which the number of accessible states is a maximum. Such average 
values (used in either of these two senses) are called thermal equilibrium values. 

Because of the sharp maximum, we may replace the average of a physical 
quantity over all accessible configurations (6) by an average over only the most 
probable configuration (8). In the example below we estimate the error involved 
in such a replacement and find the error to be negligible. 


Most Probable Configuration 


Example: Two spin systems in thermal contact. We investigate for the model spin system 
the sharpness of the product (7) near the maximum (8) as follows. We form the product of 
the multiplicity functions for g,(N,,s,) and g2(N2,82), both of the form of (1.35): 


Dee 25.4 
1 “), 0) 


9:(N1,8;)92(N 3,82) = os (asO)exn( = BENG: 
1 2 


where g,(0) denotes g,(N,0) and g,(0) denotes g(N>,0). We replace s, by s — s;: 


O52  2(5:—.8,)* 
g(Ny,8,)ax(No.s — s,) = os (O)gsO)exp( —2 7 al (10) 


This product* gives the number of states accessible to the combined system when the spin 
excess of the combined system is 2s, and the spin excess of the first system is 2s,. 

We find the maximum value of (10) as a function of s, when the total spin excess 2s is held 
constant; that is, when the energy of the combined systems is constant. It is convenient 
to use the property that the maximum of log y(x) occurs at the same value of x as the 
maximum of (x). The calculation can be done either way. From (10), 


25,2 2s — s,)? 
logg,(Ns,s1)93(N2,8 ~ 51) = loggy(0g30) ~~ “FSF yy) 
1 2 


This quantity is an extremum when the first derivative with respect to s, is zero. An ex- 
tremum may be a maximum, a minimum, or a point of inflection. The extremum is a 
maximum if the second derivative of the function is negative, so that the curve bends 
downward. 

At the extremum the first derivative is 


et age Sie) 


N, vA 0, (12) 


é 
an {log g(N1,81)g2(N2,5 — s1)} = — 
S1 


where N,, N3, and s are held constant as s, is varied. The second derivative 0?/ds,? of 


Equation (11) is 
1 1 
Dy | (psec a, tana 
is ‘i a 


* The product function of two Gaussian functions is always a Gaussian. 
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and is negative, so that the extremum is a maximum. Thus the most probable configuration 
of the combined system is that for which (12) is satisfied: 


So Sos Ss Se 
et = 13 
Wie Ne TNS eo) 


The two systems are in equilibrium with respect to interchange of energy when the fractional 
spin excess of system 1 is equal to the fractional spin excess of system 2. 

We prove that nearly all the accessible states of the combined systems satisfy or very 
nearly satisfy (13). If §, and §, denote the values of s, and s, at the maximum, then (13) 
is written as 


(14) 


To find the number of states in the most probable configuration, we insert (14) in (9) to 
obtain 


(9:92)max = 91(S1)G2(S — $1) = gi(0)g(0) exp(—2s?/N). (15) 


To investigate the sharpness of the maximum of g,g, at a given value of s, introduce 6 
such that 


s,= 5, + 0; so = 8, —6. (16) 


Here 6 measures the deviation of s,, s. from their values S,, S, at the maximum of g,g2. 
Square s,, s, to form 


s,? = 8? + 286 + 67; 5,7 = §,? — 28,6'+ 8? , 
which we substitute in (9) and (15) to obtain the number of states 


otivii~tmeie 
GN 1,81)92(N 2,82) = (9192)max EXP Ni N, N3 Na)" 


We know from (14) that §,/N, = $,/N>, so that the number of states in a configuration of 
deviation 6 from equilibrium is 


é 267-26? 
91(N 1.81 + 9)g2(N 2,82 — 6) = (03) 80( — e x) (17) 


As a numerical example in which the fractional deviation from equilibrium is very small, 
let N, = N2 = 107? and 6 = 10'?; that is, 6/N, = 10~1°. Then 267/N, = 200, and the 


Thermal Equilibrium 


product g,g2 is reduced to e~*°° = 107*7* of its maximum value. This is an extremely 
large reduction, so that g,g> is truly a very sharply peaked function of s,. The probability 
that the fractional deviation will be 10~'° or larger is found by integrating (17) from 
6 = 10’? out toa value of the order of s or of N, thereby including the area under the wings 
of the probability distribution. This is the subject of Problem 6. An upper limit to the 
integrated probability is given by N x 10-174 = 10~'5?, still very small. When two 
systems are in thermal contact, the values of s,, s, that occur most often will be very close to 
the values of S;, §, for which the product g,g> is a maximum. It is extremely rare to find 
systems with values of s,, s, perceptibly different from $,, 5. 

What does it mean to say that the probability of finding the system with a fractional 
deviation larger than 6/N; = 10~'° is only 107 'S? of the probability of finding the system 
in equilibrium? We mean that the system will never be found with a deviation as much as 
1 part in 10*°, small as this deviation seems. We would have to sample 105? similar systems 
to have a reasonable chance of success in such an experiment. If we sample one system every 
10~'? s, which is pretty fast work, we would have to sample for 10'*° s. The age of the 
universe is only 10'* s. Therefore we say with great surety that the deviation described will 
never be observed. The estimate is rough, but the message is correct. The quotation from 
Boltzmann given at the beginning of this chapter is relevant here. 

We may expect to observe substantial fractional deviations only in the properties of a 
small system in thermal contact with a large system or reservoir. The energy of a small 
system, say a system of 10 spins, in thermal contact with a large reservoir may undergo 
fluctuations that are large in a fractional sense, as have been observed in experiments on the 
Brownian motion of small particles in suspension in liquids. The average energy of a small 
system in contact with a large system can always be determined accurately by observations 
at one time on a large number of identical small systems or by observations on one small 
system over a long period of time. 


THERMAL EQUILIBRIUM 


The result for the number of accessible states of two model spin systems in 
thermal contact may be generalized to any two systems in thermal contact, with 
constant total energy U = U, + U>. By direct extension of the earlier argu- 
ment, the multiplicity g(N,U) of the combined system is: 


g(N,U) = Lai(NUs)gx(N2,U —U,), (18) 


summed over all values of U, < U. Here g,(N,,U,) is the number of accessible 
states of system 1 at energy U,. A configuration of the combined system is 
specified by the value of U,, together with the constants U, N,, N,. The number 
of accessible states in a configuration is the product g,(N,,U;)g2(N2,U — Uj). 
The sum over all configurations gives g(N,U). 
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The largest term in the sum in (18) governs the properties of the total system 
in thermal equilibrium. For an extremum it is necessary that the differential* of 
g(N,U) be zero for an infinitesimal exchange of energy: 


1 


6 6 
w= (Ge) nee + (Ge) arenas arceans=a op 
N2 


We divide by g;g2 and use the result dU, = —dU, to obtain the thermal 
equilibrium condition: 


1 zu) 1 (#2) 

if iw ar alee 20a) 

I (sce w, 92\0U2/n, : 
which we may write as 

Ologg, _ (@logg2 

(ee im BUs- Jug (20b) 


We define the quantity o, called the entropy, by 


o(N,U) = logg(N,U) , (21) 


where a is the Greek letter sigma. We now write (20) in the final form 


— 
| 


Ce a 

e ) . er ii 
691 
6U;)n, 


means that N, is held constant in the differentiation of g,(N,,U,) with respect to U,. That is, the 
partial derivative with respect to U, is defined as 
~~ — 


091 Hina 9i(N1,U, + AU,) — gi(N1,U4) 
6U,]y,  av,>0 AU, 


* The notation 


For example, if g(x,y) = 3x*y, then (@g/@x), = 12xy and (@g/éy),. = 3x*. 


Temperature 


This is the condition for thermal equilibrium for two systems in thermal 
contact. Here N, and N, may symbolize not only the numbers of particles, but 
all constraints on the systems. 


TEMPERATURE 


The last equality (22) leads us immediately to the concept of temperature. We 
know the everyday rule: in thermal equilibrium the temperatures of the two 
systems are equal: 


T= Ts (23) 


This rule must be equivalent to (22), so that T must be a function of (d0/AU)y. 
If T denotes the absolute temperature in kelvin, this function is simply the 
inverse relationship 


1 6a 
7-(e0), i 


The proportionality constant k, is a universal constant called the Boltzmann 
constant. As determined experimentally, 


kp = 1.381 x 10773 joules/kelvin 


1.381 x 107° ergs/kelvin. (25) 


We defer the discussion to Appendix B because we prefer to use a more natural 
temperature scale: we define the fundamental temperature t by 


(26) 


This temperature differs from the Kelvin temperature by the scale factor, kp: 


aT. (27) 


Because g is a pure number, the fundamental temperature t has the dimensions 
of energy. We can use as a temperature scale the energy scale, in whatever unit 
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may be employed for the latter—joule or erg. This procedure is much simpler 
than the introduction of the Kelvin scale in which the unit of temperature is 
arbitrarily selected so that the triple point of water is exactly 273.16 K. The triple 
point of water is the unique temperature at which water, ice, and water vapor 
coexist. 

Historically, the conventional scale dates from an age in which it was possible 
to build accurate thermometers even though the relation of temperature to 
quantum states was as yet not understood. Even at present, it is still possible to 
measure temperatures with thermometers calibrated in kelvin to a higher 
precision than the accuracy with which the conversion factor ke itself is known— 
about 32 parts per million. Questions of practical thermometry are discussed in 
Appendix B. 


Comment. n (26) we defined the reciprocal of t as the partial derivative (00/0U)y. It is 
permissible to take the reciprocal of both sides to write 


t = (0U/éo)y. (28) 


The two expressions (26) and (28) have a slightly different meaning. In (26), the entropy ¢ 
was given as a function of the independent variables U and N as o = o(U,N). Hence t 
determined from (26) has the same independent variables, t = t(U,N). In (28), however, 
differentiation of U with respect to ¢ with N constant implies U = U(o,N), so that t = 
1(a,N). The definition of temperature is the same in both cases, but it is expressed as a 
function of different independent variables. The question “What are the independent 
variables?” arises frequently in thermal physics because in some experiments we control 
some variables, and in other experiments we control other variables. 


ENTROPY 
The quantity ¢ = logg was introduced in (21) as the entropy of the system: the 


entropy is defined as the logarithm of the number of states accessible to the 
system. As defined, the entropy is a pure number. In classical thermodynamics 


the entropy S is defined by 
il os 
7* (3); — 


Uz 
7) cold 
o,(initial) 


U, + 6U 


Entropy 


Figure 2.7 If the temperature rt, is higher 
than T2, the transfer of a positive amount of 
energy 6U from system 1 to system 2 will 
increase the total entropy o, + o2 of the 
combined systems over the initial value 
¢,(initial) + (initial). In other words, the 
final system will be in a more probable 
condition if energy flows from the warmer body 
to the cooler body when thermal contact is 
established. This is an example of the law of 


o,(final) increasing entropy, 


NI] 


Energy transfer 


o, (final) + o(final) > o, (initial) + o,(initial) 


As a consequence of (24), we see that S and o are connected by a scale factor: 


S =kgo. | (30) 


We will call S the conventional entropy. 

The more states that are accessible, the greater the entropy. In the definition 
of o(N,U) we have indicated a functional dependence of the entropy on the 
number of particles in the system and on the energy of the system. The entropy 
may depend on additional independent variables: the entropy of a gas (Chapter 
3) depends on the volume. 

In the early history of thermal physics the physical significance of the entropy 
was not known. Thus the author of the article on thermodynamics in the 
Encyclopaedia Britannica, 11th ed. (1905), wrote: “The utility of the conception 
of entropy . . . is limited by the fact that it does not correspond directly to any 
directly measurable physical property, but is merely a mathematical function 
of the definition of absolute temperature.” We now know what absolute physical 
property the entropy measures. An example of the comparison of the experi- 
mental determination and theoretical calculation of the entropy is discussed in 
Chapter 6. 

Consider the total entropy change Ac when we remove a positive amount of 
energy AU from 1 and add the same amount of energy to 2, as in Figure 2.7. 
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The total entropy change is 


_ (ea éo5 ae 
Ac (#), AU) + (%),a = ( i *)au. (31) 


When 1, > 12 the quantity in parentheses on the right-hand side is positive, 
so that the total change of entropy is positive when the direction of energy flow 
is from the system with the higher temperature to the system with the lower 
temperature. 


Example: Entropy increase on heat flow. This example makes use of the reader's previous 
familiarity with heat and specific heat. 

(a) Let a 10-g specimen of copper at a temperature of 350 K be placed in thermal contact 
with an identical specimen at a temperature of 290K. Let us find the quantity of energy 
transferred when the two specimens are placed in contact and come to equilibrium at the 
final temperature T ,. The specific heat of metallic copper over the temperature range 15°C 
to 100°C is approximately 0.389] g~'K~*, according to a standard handbook. 

The energy increase of the second specimen is equal to the energy loss of the first; thus 
the energy increase of the second specimen is, in joules, 


AU = (3.89JK~!)(T, — 290K) = (3.89JK~')(350K — Ty) 
where the temperatures are in kelvin. The final temperature after contact is 
T, = 4(350 + 290)K = 320K. 
Thus 
AU, = (3.89JK~!)(—30K) = -11.7J , 
and 


AU, = —AU, = 11,7J. 


(b) What is the change of entropy of the two specimens when a transfer of 0.1J has 
taken place, almost immediately after initial contact? Notice that this transfer is a small 
fraction of the final energy transfer as calculated above. Because the energy transfer con- 
sidered is small, we may suppose the specimens are approximately at their initial tempera- 
tures of 350 and 290K. The entropy of the first body is decreased by 


_ 01 Be er 
AS, = Sap = ~286 x 10-*I KK“. 


Law of Increase of Entropy 


The entropy of the second body is increased by 


OLS 


MSs = 590K 


= 3.45 x 10°*IK7!. 


The total entropy increases by 
AS, + AS, = (—2.86 + 3.45) x 10°*JK~! = 0.59 x 10°4JK~1. 
In fundamental units the increase of entropy is 


meso * 059: x10 *IK>* 


Ag. kp (1.38 x 10°=JK™ 


= 0/43;x 10%? , (32) 


where ky is the Boltzmann constant. This result means that the number of accessible states 
of the two systems increases by the factor exp(Ac) = exp(0.43 x 10'°). 


Law of Increase of Entropy 


We can show that the total entropy always increases when two systems are 
brought into thermal contact. We have just demonstrated this in a special case. 
If the total energy U = U, + U, is constant, the total multiplicity after the 
systems are in thermal contact is 


gU) = ¥g(U)gAU — U,), (33) 
U; 


by (18). This expression contains the term g,(U,o)g2(U — Uo) for the initial 
multiplicity before contact and many other terms besides. Here Uy is the 
initial energy of system 1 and U — U, is the initial energy of system 2. Because 
all terms in (33) are positive numbers, the multiplicity is always increased by 
establishment of thermal contact between two systems. This is a proof of the 
law of increase of entropy for a well-defined operation. 

The significant effect of contact, the effect that stands out even after taking the 
logarithm of the multiplicity, is not just that the number of terms in the summa- 
tion is large, but that the largest single term in the summation may be very, very 
much larger than the initial multiplicity. That is, 


(9192)max = 91(01)g21U — 04) (34) 
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Figure 2.8 A system with two parts, 1 and 2, is prepared at zero time 
with U, = Oand U, = U. Exchange of energy takes place between two 
parts and presently the system will be found in or close to the most 
probable configuration. The entropy increases as the system attains 
configurations of increasing multiplicity or probability. The entropy 
eventually reaches the entropy o(U) of the most probable configuration. 


may be very, very much larger than the initial term 
9:(Uyo)g2x(U — Uyo). (35) 


Here 0, denotes the value of U, for which the product g,g> is a maximum. 
The essential effect is that the systems after contact evolve from their initial 
configurations to their final configurations. The fundamental assumption 
implies that evolution in this operation will always take place, with all accessible 
final states equally probable. 

The statement 


Frinat ~ 108(91G2)max = initia = 108(9192)0 (36) 


is a statement of the law of increase of entropy: the entropy of a closed system 
tends to remain constant or to increase when a constraint internal to the system 
is removed. The operation of establishing thermal contact is equivalent to the 
removal of the constraint that U,, U, each be constant; after contact only 
U, + U, need be constant. 

The evolution of the combined system towards the final thermal equilibrium 
configuration takes a certain time. If we separate the two systems before they 
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Figure 2.9 Operations that tend to increase the entropy of a system. 


reach this configuration, we will obtain an intermediate configuration with 
intermediate energies and an intermediate entropy. It is therefore meaningful to 
view the entropy as a function of the time that has elapsed since removal of the 
constraint, called the time of evolution in Figure 2.8. 

Processes that tend to increase the entropy of a system are shown in Figure 
2.9; the arguments in support of each process will be developed in the chapters 
that follow. 
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For a large system* (in thermal contact with another large system) there will 
never occur spontaneously significant differences between the actual value of the 
entropy and the value of the entropy of the most probable configuration of the 
system. We showed this for the model spin system in the argument following (17); 
we used “never” in the sense of not once in the entire age of the universe, 10* s. 
We can only find a significant difference between the actual entropy and the 
entropy of the most probable configuration of the macroscopic system very 
shortly after we have changed the nature of the contact between two systems, 
which implies that we had prepared the system initially in some special way. 
Special preparation could consist of lining up all the spins in one system parallel 
to one another or collecting all the molecules in the air of the room into the 
system formed by a small volume in one corner of the room. Such extreme 
situations never arise naturally in systems left undisturbed, but arise from 
artificial operations performed on the system. 

Consider the gas in a room: the gas in one half of the room might be prepared 
initially with a low value of the average energy per molecule, while the gas in the 
other half of the room might be prepared initially with a higher value of the 
average energy per molecule. If the gas in the two halves is now allowed to 
interact by removal of a partition, the gas molecules will come very quickly’ 
to a most probable configuration in which the molecules in both halves of the 
room have the same average energy. Nothing else will ever be observed to 
happen. We will never observe the system to leave the most probable configura- 
tion and reappear later in the initial specially prepared configuration. This is true 
even though the equations of motion of physics are reversible in time and do not 
distinguish past and future. 


LAWS OF THERMODYNAMICS 


When thermodynamics is studied as a nonstatistical subject, four postulates 
are introduced. These postulates are called the laws of thermodynamics. In 
essence, these laws are contained within our statistical formulation of thermal 
physics, but it is useful to exhibit them as separate statements. 

Zeroth law. If two systems are in thermal equilibrium with a third system, 
they must be in thermal equilibrium with each other. This law is a consequence 


* A large or macroscopic system may be taken to be one with more than 10'° or 10" atoms. 
* The calculation of the time required for the process is largely a problem in hydrodynamics. 


Laws of Thermodynamics 


of the condition (20b) for equilibrium in thermal contact: 


(: oes) (: et) ; (“ae) “ (- eet) 
ou, Jy, 6U3 Jy,” 6U, Jy, 0U3 Jy, 
In other words, t, = t3 and t, = T3 imply t, = 73. 

First law. Heat is a form of energy. This law is no more than a statement of 
the principle of conservation of energy. Chapter 8 discusses what form of energy 
heat is. 

Second law. There are many equivalent statements of the second law. We 
shall use the statistical statement, which we have called the law of increase of 
entropy, applicable when a constraint internal to a closed system is removed. The 
commonly used statement of the law of increase of entropy is: “Ifa closed system 
is in a configuration that is not the equilibrium configuration, the most probable 
consequence will be that the entropy of the system will increase monotonically 
in successive instants of time.” This is a looser statement than the one we gave 
with Eq. (36) above. 

The traditional thermodynamic statement is the Kelvin-Planck formulation 
of second law of thermodynamics: “It is impossible for any cyclic process to 
occur whose sole effect is the extraction of heat from a reservoir and the per- 
formance of an equivalent amount of work.” An engine that violates the second 
law by extracting the energy of one heat reservoir is said to be performing 
perpetual motion of the second kind. We will see in Chapter 8 that the Kelvin- 
Planck formulation is a consequence of the statistical statement. 

Third law. The entropy of a system approaches a constant value as the 
temperature approaches zero. The earliest statement of this law, due to Nernst, is 
that at the absolute zero the entropy difference disappears between all those 
configurations of a system which are in internal thermal equilibrium. The third 
law follows from the statistical definition of the entropy, provided that the 
ground state of the system has a well-defined multiplicity. If the ground state 
multiplicity is g(0), the corresponding entropy is o(0) = logg(0) as t > 0. 
From a quantum point of view, the law does not appear to say much that is 
not implicit in the definition of entropy, provided, however, that the system is 
in its lowest set of quantum states at absolute zero. Except for glasses, there 
would not be any objection to affirming that g(0) is a small number and o(0) 
is essentially zero. Glasses have a frozen-in disorder, and for them o(0) can be 
substantial, of the order of the number of atoms N. What the third law tells us 
in real life is that curves of many reasonable physical quantities plotted against t 
must come in flat as t approaches 0. 
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Entropy as a Logarithm 


Several useful properties follow from the definition of the entropy as the loga- 
rithm of the number of accessible states, instead of as the number of accessible 
states itself. First, the entropy of two independent systems is the sum of the 
separate entropies. 

Second, the entropy is entirely insensitive—for all practical purposes—to 
the precision 6U with which the energy of a closed system is defined. We have 
never meant to imply that the system energy is known exactly, a circumstance 
that for a discrete spectrum of energy eigenvalues would make the number of 
accessible states depend erratically on the energy. We have simply not paid 
much attention to the precision, whether it be determined by the uncertainty 
principle 6U (time) ~ h, or determined otherwise. Define ®(U) as the number 
of accessible states per unit energy range; ®(U) can be a suitable smoothed 
average centered at U. Then g(U) = ®(U)6U is the number of accessible 
states in the range dU at U. The entropy is 


a(U) = log D{U)dU = log B(U) + logdU. (37) 


Typically, as for the system of N spins, the total number of states will be of the 
order of 2%. If the total energy is of the order of N times some average one- 
particle energy A, then ®(U) ~ 2*/NA. Thus 


o(U) = Nlog2 — logNA + logéU. (38) 
Let N = 107°; A = 107 '* erg; and dU = 10~' erg. 
o(U) = 0.69 x 107° — 13.82 — 2.3. (39) 


We see from this example that the value of the entropy is dominated overwhelm- 
ingly by the value of N; the precision dU is without perceptible effect on the 
result. In the problem of N free particles in a box, the number of states is propor- 
tional to something like UY dU, whence ¢ ~ NlogU + logé6U. Again the 
term in N is dominant, a conclusion independent of even the system of units 
used for the energy. 


Example: Perpetual motion of the second kind. Early in our study of physics we came to 
understand the impossibility of a perpetual motion machine, a machine that will give forth 
more energy than it absorbs. 


Summary 


Equally impossible is a perpetual motion machine of the second kind, as it is called, in 
which heat is extracted from part of the environment and delivered to another part of the 
environment, the difference in temperature thus established being used to power a heat 
engine that delivers mechanical work available for any purpose at no cost to us. In brief, we 
cannot propel a ship by cooling the surrounding ocean to extract the energy necessary to 
propel the ship. The spontaneous transfer of energy from the low temperature ocean to a 
higher temperature boiler on the ship would decrease the total entropy of the combined 
systems and would thus be in violation of the law of increase of entropy. 


SUMMARY 

1. The fundamental assumption is that a closed system is equally likely to be 
in any of the quantum states accessible to it. 

2. If P(s) is the probability that a system is in the state s, the average value of a 
quantity X is 


<X> = ¥ X(s)P(s). 


3. An ensemble of systems is composed of very many systems, all constructed 
alike. 


4, The number of accessible states of the combined systems 1 and 2 is 
g(s) = Yaulsigals — 5), 


where s; + S2 = S. 


5. The entropy o(N,U) = logg(N,U). The relation S = kgo connects the 
conventional entropy S with the fundamental entropy o. 


6. The fundamental temperature t is defined by 
I1/t = (@0/0U)y y- 
The relation t = kgT connects the fundamental temperature and the con- 
ventional temperature, 


7. The law of increase of entropy states that the entropy of a closed system 
tends to remain constant or to increase when a constraint internal to the 
system is removed. 
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8. The thermal equilibrium values of the physical properties of a system are 
defined as averages over all states accessible when the system is in contact 
with a large system or reservoir. If the first system also is large, the thermal 
equilibrium properties are given accurately by consideration of the states in 
the most probable configuration alone. 


PROBLEMS 


1. Entropy and temperature. Suppose g(U) = CU*’’?, where C is a constant 
and N is the number of particles. (a) Show that U = 3Nr. (b) Show that 
(€7a/@U?)y is negative. This form of g(U) actually applies to an ideal gas. 


2. Paramagnetism. Find the equilibrium value at temperature t of the frac- 
tional magnetization 


M/Nm = 2<sy/N 


of the system of N spins each of magnetic moment m in a magnetic field B. The 
spin excess is 2s, Take the entropy as the logarthithm of the multiplicity g(N,s) 
as given in (1.35): 


a(s) = logg(N,0) — 2s?/N , (40) 
for |s| «< N. Hint: Show that in this approximation 
o(U) = a9 — U?/2m?B?N , (41) 


with a» = logg(N,0). Further, show that 1/t = — U/m?B?N, where U denotes 
<U), the thermal average energy. 


3. Quantum harmonic oscillator. (a) Find the entropy of a set of N oscillators 
of frequency w as a function of the total quantum number n. Use the multiplicity 
function (1.55) and make the Stirling approximation log N! ~ NlogN — N. 
Replace N — 1 by N. (b) Let U denote the total energy nhw of the oscillators. 
Express the entropy as a(U,N). Show that the total energy at temperature t is 


Nha 


ie exp(hw/t) — 1° ) 


This is the Planck result; it is derived again in Chapter 4 by a powerful method 
that does not require us to find the multiplicity function. 


Problems 


4. The meaning of “never.” It has been said* that “six monkeys, set to strum 
unintelligently on typewriters for millions of years, would be bound in time 
to write all the books in the British Museum.” This statement is nonsense, for 
it gives a misleading conclusion about very, very large numbers. Could all the 
monkeys in the world have typed out a single specified book in the age of the 
universe?* 

Suppose that 10'° monkeys have been seated at typewriters throughout the 
age of the universe, 10'® s, This number of monkeys is about three times greater 
than the present human population‘ of the earth. We suppose that a monkey 
can hit 10 typewriter keys per second. A typewriter may have 44 keys; we 
accept lowercase letters in place of capital letters. Assuming that Shakespeare’s 
Hamlet has 10° characters, will the monkeys hit upon Hamlet? 

(a) Show that the probability that any given sequence of 10° characters 
typed at random will come out in the correct sequence (the sequence of Hamlet) 
is of the order of 


(ay? 000 _ 107164 345 


where we have used log;o 44 = 1.64345. 

(b) Show that the probability that a monkey-Hamlet will be typed in the age 
of the universe is approximately 10~'°**!®. The probability of Hamlet is 
therefore zero in any operational sense of an event, so that the original statement 
at the beginning of Tag problem is nonsense: one book, much less a library, 
will never occur in the total literary production of the monkeys. 


5. Additivity of entropy for two spin systems. Given two systems of N, ~ 
N, = 10? spins with multiplicity functions g,(N,,s;) and g(N2,5 — s,), the 
product g,g asa function of, is relatively sharply peaked at s; = $,. Fors, = 
$, + 10!, the product g,g2 is reduced by 10~'’* from its peak value. Use the 
Gaussian approximation to the multiplicity function; the form (17) may be 
useful. 

(a) Compute g192/(9:92)max for sy = S, + 10** and s = 0. 

(b) For s = 107°, by what factor must you multiply (9192)max to make it 
equal to ¥.,9:(N,.8:)92(N2,8 — 51); give the factor to the nearest order of 
magnitude. 


* J. Jeans, Mysterious universe, Cambridge University Press, 1930, p. 4. The statement is attributed 
to Huxley. 

t Fora related mathematico-literary study, see “The Library of Babel,” by the fascinating Argentine 
writer Jorge Luis Borges, in Ficciones, Grove Press, Evergreen paperback, 1962, pp. 79-88. 

+ For every person now alive, some thirty persons have once lived. This figure is quoted by A. C. 
Clarke in 200]. We are grateful to the Population Reference Bureau and to Dr. Roger Revelle for 
explanations of the evidence. The cumulative number of man-seconds is 2 x 107°, if we take the 
average lifetime as 2 x 10° s and the number of lives as 1 x 10*!. The cumulative number of 
man-seconds is much less than the number of monkey-seconds (107%) taken in the problem. 
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(c) How large is the fractional error in the entropy when you ignore this 
factor? 


6. Integrated deviation. For the example that gave the result (17), calculate 
approximately the probability that the fractional deviation from equilibrium 
6/N, is 10~*° or larger. Take N, = N, = 10??. You will find it convenient to 
use an asymptotic expansion for the complementary error function. When 
pie & 


2x exp(x?) fPexp(-e)dr = 1+ small terms. 


pe? owed 32, 
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Units: Thermodynamic results can easily be translated from fundamental units to conventional 
units. The only quantity that will cause difficulty is the heat capacity, defined below in (17a) as 
C(fund.) = 1(¢o/¢r) in fundamental units and as C(conv). = T(@S/T) in conventional units, These 
two quantities are not equal, for C(conv.) = k,C(fund.). 


Chapter 3: Boltzmann Distribution and Helmholtz Free Energy 


The laws of thermodynamics may easily be obtained from the principles of 
statistical mechanics, of which they are the incomplete expression. 


Gibbs 


We are able to distinguish in mechanical terms the thermal action of one system 
on another from that which we call mechanical in the narrower sense . . . So as 
to specify cases of thermal action and cases of mechanical action. 


Gibbs 
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In this chapter we develop the principles that permit us to calculate the values 
of the physical properties of a system as a function of the temperature. We 
assume that the system 8 of interest to us is in thermal equilibrium with a very 
large system ®, called the reservoir. The system and the reservoir will have a 
common temperature t because they are in thermal contact. 

The total system ® +8 is a closed system, insulated from all external 
influences, as in Figure 3.1. The total energy Up = Ug + Us is constant. In 
particular, if the system is in a state of energy ¢,, then Uo — ¢, is the energy of 
the reservoir. 


Figure 3.1 Representation of a closed total system decomposed into a 
reservoir ® in thermal contact with a system 38. 


BOLTZMANN FACTOR 


A central problem of thermal physics is to find the probability that the system 
8 will be in a specific quantum state s of energy ¢,. This probability is propor- 
tional to the Boltzmann factor. 

When we specify that 8 should be in the state s, the number of accessible 
states of the total system is reduced to the number of accessible states of the 
reservoir ®, at the appropriate energy. That is, the number ge. of states 
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Figure 3.2. The change of entropy when the 
reservoir transfers energy ¢ to the system. The 
fractional effect of the transfer on the reservoir 
is small when the reservoir is large, because a 
large reservoir will have a high entropy. 


Entropy of the reservoir 


Uy -e Uy 
Energy of the reservoir——=— 


accessible toR +8 is 
Gan X1=9e + (1) 


because for our present purposes we have specified the state of 3 . 

If the system energy is ¢,, the reservoir energy is Ug — ¢,. The number of 
states accessible to the reservoir in this condition isgq(U 9 — ¢,), as in Figure 3.2. 
The ratio of the probability that the system is in quantum state 1 at energy 
, to the probability that the system is in quantum state 2 at energy «2 is the 
ratio of the two multiplicities: 


P(e,) | Multiplicity of R at energy Up — &; — ga(Uo — &) 
P(e.) Multiplicity of R at energy Up — &2 — ga(Uo — &2) 


(2) 


This result is a direct consequence of what we have called the fundamental 
assumption. The two situations are shown in Figure 3.3. Although questions 
about the system depend on the constitution of the reservoir, we shall see that 
the dependence is only on the temperature of the reservoir. 

If the reservoirs are very large, the multiplicities are very, very large numbers. 
We write (2) in terms of the entropy of the reservoir: 


Plex) _ explog(Uo — &)] 
P(e) explog(Uo — &2)] 


exp[og(Uo — &) — oa(Uo — &2)]- (3) 
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R 
Energy U, — €, 


Bq (Uy — £;) States 


(a) (b) 


Figure 3.3 The system in (a), (b) is in quantum state 1, 2. The reservoir 
has gg(Uo — &:), ga(Uo — &2) accessible quantum states, in (a) and (b) 
respectively. 


With 
Aga = o9(Up — &) — Ge(Uo — &2) , (4) 
the probability ratio for the two states 1, 2 of the system is simply 


P(e) 
P(é3) 


= exp(Acg). (5) 


Let us expand the entropies in (4) in a Taylor series expansion about (U9). 
The Taylor series expansion of f(x) about f(xp) is 


2. 
L(Xo +a) = f(xXo) + (%),.. + n*(3),... tov. (6) 


dx? 
Thus 


Ga(Ug — &) = Ga(Uo) — &(0a—/0U)y.y + +> 
Gg(Uo) — e/t +°°°, (7) 


where 1/t = (a/6U)y.y gives the temperature. The partial derivative is taken 


Partition Function 


at energy Uy. The higher order terms in the expansion vanish in the limit of 
an infinitely large reservoir.* 
Therefore Ac, defined by (4) becomes 


Aog = —(& — &)/t. (8) 


The final result of (5) and (8) is 
Plex) _ exp(—é1/t) 
P(e.) exp(—é2/t) | 


(9) 


A term of the form exp(—¢/t) is known as a Boltzmann factor. This result is 
of vast utility. It gives the ratio of the probability of finding the system in a 
single quantum state 1 to the probability of finding the system in a single 
quantum state 2. 


Partition Function 


It is helpful to consider the function 


Z(t) = ¥ exp(—«,/) , (10) 


called the partition function. The summation is over the Boltzmann factor 
exp(—é,/t) for all states s of the system. The partition function is the pro- 
portionality factor between the probability P(e.) and the Boltzmann factor 
exp(—e,/t): 


exp(—é,/T) 


P(e.) = Z 


(11) 


We see that }° P(e,) = Z/Z = 1: the sum of all probabilities is unity. 
The result (11) is one of the most useful results of statistical physics. The 
average energy of the system is U = (2) = ).¢,P(¢,), or 


py _ LeexPl— a8) _ 


204 7 
Z 17(é log Z/ét). (12) 


* Weexpand a(U, — #)and not g(U» — e) because the expansion of the latter quantity immediately 
gives convergence difficulties. 
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0.5 
0.4 
U/e 
0.3 
Figure 3.4 Energy and heat capacity of a 
two state system as functions of the temperature 02 
t. The energy is plotted in units of e. : 
0.1 
0 
0 1 2 


t/e—— 


The average energy refers to those states of a system that can exchange energy 
with a reservoir. The notation <---> denotes such an average value and is 
called the thermal average or ensemble average. In (12) the symbol U is used 
for <2) in conformity with common practice; U will now refer to the system 
and not, as earlier, to the system + reservoir. 


Example: Energy and heat capacity of a two state system. We treat a system of one parti- 
cle with two states, one of energy 0 and one of energy e. The particle is in thermal contact 
with a reservoir at temperature t. We want to find the energy and the heat capacity of the 
system as a function of the temperature t. The partition function for the two states of 
the particle is 


Z = exp(—0/t) + exp(—e/t) = 1 + exp(—e/t). (13) 
The average energy is 


eexp(—e/t) exp(—e/t) 
Ze ie exp(—é/t)’ 


Us) = (14) 


This function is plotted in Figure 3.4. 
If we shift the zero of energy and take the energies of the two states as —4e and +42, 
instead of as 0 and «, the results appear differently. We have 


Z = exp(e/2t) + exp(—e/2t) = 2 cosh(e/2t) , (15) 


Partition Function 


and 


(—4e)exp(e/2z) + (Be)exp(—e/2z) __sinh(e/2z) 
rs Z ~~" F cosh(e/2t) 


= —tetanh(e/2z). (16) 


<> 


The heat capacity C, of a system at constant volume is defined as 
Cy = t(0a/ét)y , (17a) 


which by the thermodynamic identity (34a) derived below is equivalent to the alternate 
definition 


Cy = (6U/ét)y. (17b) 


We hold V constant because the values of the energy are calculated for a system ata specified 
volume. From (14) and (17b), 


= 22 1 e\? __exp(e/t) (18a) 
vec exp(e/t) +1 \x/) [exp(e/t) + 1)?” 
The same result follows from (16). 
In conventional units Cy is defined as T(@S/@T), or (GU/éT)y, whence 
e \? _— exp(e/kgT) 
i y= - s 18b. 
(conventional) Cy b( <5) Texp(e/keT) + iP (18b) 


In fundamental units the heat capacity is dimensionless; in conventional units it has the 
dimensions of energy per kelvin. The specific heat is defined as the heat capacity per unit 
mass. 

The hump in the plot of heat capacity versus temperature in Figure 3.4 is called a Schottky 
anomaly. For t > « the heat capacity (18a) becomes 


Cy = (e/2t)*. (19) 


Notice that C, x t~? in this high temperature limit. In the low temperature limit the 
temperature is small in comparison with the energy level spacing ¢. For t « « we have 


Cy ~ (e/t)? exp(—e/z). (20) 


The exponential factor exp(—e/z) reduces C, rapidly as t decreases, because exp(—1/x) + 0 
asx > 0. 
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Definition: Reversible process. A process is reversible if carried out in such a 
way that the system is always infinitesimally close to the equilibrium condition. 
For example, if the entropy is a function of the volume, any change of volume 
must be carried out so slowly that the entropy at any volume V is closely equal 
to the equilibrium entropy o(V). Thus, the entropy is well defined at every 
stage of a reversible process, and by reversing the direction of the change the 
system will be returned to its initial condition. In reversible processes, the 
condition of the system is well defined at all times, in contrast to irreversible 
processes, where usually we will not know what is going on during the process. 
We cannot apply the mathematical methods of thermal physics to systems 
whose condition is undefined. 

A volume change that leaves the system in the same quantum state is an 
example of an isentropic reversible process. If the system always remains in the 
same state the entropy change will be zero between any two stages of the pro- 
cess, because the number of states in an ensemble (p. 31) of similar systems does 
not change. Any process in which the entropy change vanishes is an isentropic 
reversible process. But reversible processes are not limited to isentropic pro- 
cesses, and we shall have a special interest also in isothermal reversible processes. 


EEE EES SS eee 


PRESSURE 


Consider a system in the quantum state s of energy ¢,. We assume ¢, to be a 

function of the volume of the system. The volume is decreased slowly from V 

to V — AV by application of an external force. Let the volume change take 

place sufficiently slowly that the system remains in the same quantum state s 

throughout the compression. The “same” state may be characterized by its 

quantum numbers (Figure 3.5) or by the number of zeros in the wavefunction. 
The energy of the state s after the reversible volume change is 


e(V — AV) = &(V) — (de,/dV)AV + -*°. (21) 


Consider a pressure p, applied normal to all faces of a cube. The mechanical 
work done on the system by the pressure in a contraction (Figure 3.6) of the 
cube volume from V to V — AV appears as the change of energy of the system: 


U(V — AV) — U(V) = AU = —(de,/dV)AV. (22) 
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Figure 3.5 Dependence of energy on volume, for the energy levels of a free 
particle confined to a cube. The curves are labeled by n? = n,? + n,? + n,?, 
as in Figure 1.2. The multiplicities g are also given. The volume change here 
is isotropic: a cube remains a cube. The energy range de of the states 
represented in an ensemble of systems will increase in a reversible 
compression, but we know from the discussion in Chapter 2 that the width 
of the energy range itself is of no practical importance. It is the change in 
the average energy that is important. 


Figure 3.6 Volume change — AV in uniform 
compression of a cube. 


66 


Chapter 3: Boltzmann Distribution and Helmholtz Free Energy 


Here U denotes the energy of the system. Let A be the area of one face of the 
cube; then 


A(Ax + Ay + Az) = AV, (23) 


if all increments AV and Ax = Ay = Az are taken as positive in the compres- 
sion. The work done in the compression is 


AU = p,A(Ax + Ay + Az) = p,AV , (24) 
so that, on comparison with (22), 
ps = —de,/dV (25) 


is the pressure on a system in the state s. 
We average (25) over all states of the ensemble to obtain the average pressure 


<p>, usually written as p: 
ou 
ee po 2 
P 2) , (26) 


where U = ¢e). The entropy a is held constant in the derivative because the 
number of states in the ensemble is unchanged in the reversible compression 
we have described. We have a collection of systems, each in some state, and 
each remains in this state in the compression. 

The result (26) corresponds to our mechanical picture of the pressure on a 
system that is maintained in some specific state. Appendix D discusses the 
result more deeply. For applications we shall need also the later result (50) for 
the pressure on a system maintained at constant temperature. 

We look for other expressions for the pressure. The number of states and thus 
the entropy depend only on U and on V, for a fixed number of particles, so 
that only the two variables U and V describe the system. The differential of 


the entropy is 
a (hae 
do(U,V) = (%),#v + (S),a (27) 


This gives the differential change of the entropy for arbitrary independent 
differential changes dU and dV. Assume now that we select dU and dV inter- 
dependently, in such a way that the two terms on the right-hand side of (27) 


Thermodynamic Identity 


cancel. The overall entropy change do will be zero. If we denote these inter- 
dependent values of dU and dV by (5U), and (6V),, the entropy change will 


be zero: 
0a éo 
= (—) wu (OH). 
0 (%),6u. ZF (3), ie (28) 
After division by (6V),, 
da\ (dU), 6a 
7 (), (V), * (*), “ 


But the ratio (6U),/(6V), is the partial derivative of U with respect to V at 
constant ¢: 


(6U), (OV), = (CU/eV),. (30) 


With this and the definition 1/t = (00/0U)y, Eq. (29) becomes 


oa 
-(5), G1) 


a | 
p= (3) at (32) 
| u 


Thermodynamic Identity 
Consider again the differential (27) of the entropy; substitute the new result for 
the pressure and the definition of t to obtain 
de'=1au +P av® (33) 
T T 


or 


| tds = dU + pdV. | (34a) 
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This useful relation will be called the thermodynamic identity. The form with V 
variable will appear in (5.38). A simple transposition gives 


dU = tdo — pdV, or dU = TdS — pdv. (34b) 


If the actual process of change of state of the system is reversible, we can 
identify tdo as the heat added to the system and — pdV as the work done on 
the system. The increase of energy is caused in part by mechanical work and 
in part by the transfer of heat. Heat is defined as the transfer of energy between 
two systems brought into thermal contact (Chapter 8). 


HELMHOLTZ FREE ENERGY 


The function 


F=U-wt (35) 


is called the Helmholtz free energy. This function plays the part in thermal 
physics at constant temperature that the energy U plays in ordinary mechanical 
processes, which are always understood to be at constant entropy, because no 
internal changes of state are allowed. The free energy tells us how to balance 
the conflicting demands of a system for minimum energy and maximum en- 
tropy. The Helmholtz free energy will be a minimum for a system § in thermal 
contact with a reservoir ®, if the volume of the system is constant. 

We first show that F is an extremum in equilibrium at constant t and V. 
By definition, for infinitesimal reversible transfer from ® to 3, 


dF, = dU, — tdoy (36) 


at constant temperature. But 1/t = (@as/@Us)y, so that dU, = tdo at con- 
stant volume. Therefore (36) becomes 


dF,=0, (37) 


which is the condition for F to be an extremum with respect to all variations 
at constant volume and temperature. We like F because we can calculate it from 
the energy eigenvalues ¢, of the system (see p. 72). 


Helmholtz Free Energy 


PREPRESS Se SS MR Te RY 


Comment. We can show that the extremum is a minimum. The total energy is U = 
Ug + Ug. Then the total entropy is 


o = 6g + Gs = og(U — Us) + a(Us) 


= o5(U) — Us(Can/CU gyn + o5(Us). (38) 
We know that 
(049/6Ue)y.y = 1/t , (39) 
so that (38) becomes 
o = 6,(U) — Fy/t , (40) 


where Fy = Us — tas is the free energy of the system. Now a«(U) is constant; and we 
recall that ¢ = o_ + ag in equilibrium is a maximum with respect to Us . It follows from 
(40) that Fs must be a minimum with respect to Us when the system is in the most probable 
configuration. The free energy of the system at constant r, V will increase for any departure 
from the equilibrium configuration. 


Example: Minimum property of the free energy of a paramagnetic system. Consider the 
model system of Chapter 1, with N; spins up and N; spins down. Let N = N; + Nj; 
the spin excess is 2s = N; — N,. The entropy in the Stirling approximation is found 
with the help of an approximate form of (1.31): 


1 1 Ss 1 1 Ss 
a(s) = -(39 + :) me + x) - (GN - s)toe(; - x (41) 


The energy in a magnetic field B is — 2smB, where m is the magnetic moment of an elemen- 
tary magnet. The free energy function (to be called the Landau function in Chapter 10) is 
F;(t,s,B) = U(s,B) — to(s), or 


1 i KS 1 tee IG 
F((t,s,B) = —2smB + (GN + *} me + 5) + (GN - *} toa(5 - x) 


(42) 


At the minimum of F,(t,s,B) with respect to s, this function becomes equal to the equilibrium 
free energy F(t,B). That is, F;(t,¢s>,B) = F(z,B), because ¢s> is a function of t and B. The 
minimum of F,, with respect to the spin excess occurs when 


N + 2s 
(CF ,/0s),.3 = 0 = —2mB + tlog Nos" (43) 
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Thus in the magnetic field B the thermal equilibrium value of the spin excess 2s is given by 


N + <2s> 


No = exPOMBHA); 2s) = n( een) (44) 


exp(2mB/t) + 1 
or, on dividing numerator and denominator by exp(mB/z), 
<2s> = N tanh(mB/r). (45) 


The magnetization M is the magnetic moment per unit volume. If n is the number of spins 
per unit volume, the magnetization in thermal equilibrium in the magnetic field is 


M = <2sym/V = nmtanh(mB/2). (46) 


The free energy of the system in equilibrium can be obtained by substituting (45) in (42). 
It is easier, however, to obtain F directly from the partition function for one magnet: 


Z = exp(mB/t) + exp(—mB/t) = 2cosh(mB/). (47) 


Now use the relation F = —tlogZ as derived below. Multiply by N to obtain the result 
for N magnets. (The magnetization is derived more simply by the method of Problem 2.) 


a MTS eR 


Differential Relations 
The differential of F is 


dF = dU — tdo — odt, 
or, with use of the thermodynamic identity (34a), 
dF = —odt — pdV - (48) 


for which 


| (OF OF 
@-- @--| « 


These relations are widely used. 
The free energy F in the result p = —(@F/0V), acts as the effective energy 
for an isothermal change of volume; contrast this result with (26). The result 


we 


Calculation of F from Z 


may be written as 


6U oo 
r= -(55). + «(37),- oF 


by use of F = U — to. The two terms on the right-hand side of (50) represent 
what we may call the energy pressure and the entropy pressure. The energy 
pressure —(0U/@V), is dominant in most solids and the entropy pressure 
1(0a/0V), is dominant in gases and in elastic polymers such as rubber (Problem 
10). The entropy contribution is testimony of the importance of the entropy: 
the naive feeling from simple mechanics that —dU/dV must tell everything 
about the pressure is seriously incomplete for a process at constant temperature, 
because the entropy can change in response to the volume change even if the 
energy is independent of volume, as for an ideal gas at constant temperature. 


eT 


Maxwell relation. We can now derive one of a group of useful thermodynamic relations 
called Maxwell relations. Form the cross-derivatives 07F/0V ¢r and ¢?F/ét 2V, which must 
be equal to each other. It follows from (49) that 


(60/0V), = (@p/et)y . (51) 


a relation that is not at all obvious. Other Maxwell relations will be derived later at appro- 
priate points, by similar arguments. 


Ma rerss: sae ee Ee 


Calculation of F from Z 


Because F = U — to and ¢ = —(@F/ét)y, we have the differential equation 
F=U + t(6F/ét)y, or —176(F/t)/ét = U. (52) 
We show that this equation is satisfied by 
F/t = —logZ , (53) 
where Z is the partition function. On substitution, 


O(F/n/ét = —@log Z/ér = —U/r? (54) 
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by (12). This proves that 


| 
| F= —tloszZ (55) 


satisfies the required differential equation (52). 

It would appear possible for F/z to contain an additive constant « such that 
F = —tlogZ + at. However, the entropy must reduce to loggo when the 
temperature is so low that only the go coincident states at the lowest energy &o 
are occupied. In that limit log Z — log go — &/t, so that ¢ = —6F/ét > 
O(t log Z)/ét = log go only if x = 0. 

We may write the result as 


Z = exp(—F/t); (56) 


and the Boltzmann factor (11) for the occupancy probability of a quantum 
state s becomes 


Pte.) = SPC — expliF — 2) (57) 


IDEAL GAS: A FIRST LOOK 


One atom ina box. We calculate the partition function Z, of one atom of 
mass M free to move in a cubical box of volume V = L*. The orbitals of the 
free particle wave equation —(h?/2M)V7y = ay are 


W(x,y,z) = A sin(n,zx/L)sin(n,zy/L)sin(n,72z/L) , (58) 
where n,, ny, n, are any positive integers, as in Chapter 1. Negative integers do 


not give independent orbitals, and a zero does not give a solution. The energy 
values are 


R (nV > 2 2 
& = fa(z) (n* +.n,* + n,°). (59) 


We neglect the spin and all other structure of the atom, so that a state of the 
system is entirely specified by the values of n,, n,, m2. 


Ideal Gas: A First Look 


The partition function is the sum over the states (59): 


Z, = YY Y expl—h?n2(n,? + nj? + n,2)/2ML*z]. (60) 


Ny My Nz 


Provided the spacing of adjacent energy values is small in comparison with t, 
we may replace the summations by integrations: 


Z,= iE dn, ii dn, iE dn, exp[—27(n,? + n,? + n.)]. (61) 


The notation a7 = h?x?/2ML?z is introduced for convenience. The exponential 
may be written as the product of three factors 


exp(—a?n,?)exp(—?n,?)exp(—2?n,) , 


so that 
® 3 Ft 3 
Z,= (J. angexp(—2°n2) = war(f, axexp(—»*)) = 1/2/8953 , 


whence 


V 
Z,= Qni/Mas? = NgV = no/n, (62) 
in terms of the concentration n = 1/V. 
Here 


| Ng = (Mt/2xh?)3!? | (63) 
is called the quantum concentration. It is the concentration associated with one 
atom in a cube of side equal to the thermal average de Broglie wavelength, 
which is a length roughly equal to h/M<v> ~ h/(Mz)"!?. Here <v) is a thermal 
average velocity. This concentration will keep turning up in the thermal physics 
of gases, in semiconductor theory, and in the theory of chemical reactions. 
For helium at atmospheric pressure at room temperature, n ~ 2.5 x 
10'°cm~3 and ng = 0.8 x 1075cm~3. Thus, n/ng = 3 x 107°, which is very 
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small compared to unity, so that helium is very dilute under normal conditions. 
Whenever n/ng « 1 we say that the gas is in the classical regime. An ideal gas 
is defined as a gas of noninteracting atoms in the classical regime. 

The thermal average energy of the atom in the box is, as in (12), 


> eexp(—4/2) 


Ge ee 17(@ log Z,/ét) , (64) 


because Z,~ ' exp(—e,/t) is the probability the system is in the state n. From (62), 
logZ, = —#log(1/t) + terms independent of t , 


so that for an ideal gas of one atom 
U = #. ; (65) 


If t = kgT, where kp is the Boltzmann constant, then U = 3k,T, the well- 
known result for the energy per atom of an ideal gas. 

The thermal average occupancy of a free particle orbital satisfies the in- 
equality 


Z,~'exp(—e,/t) < Z,~' =n/ng, 


which sets an upper limit of 4 x 107° for the occupancy of an orbital by a 
helium atom at standard concentration and temperature. For the classical 
regime to apply, this occupancy must be «1. We note that g, as defined by (59) 
is always positive for a free atom. 


Example: N atoms ina box. There follows now a tricky argument that we will use 
temporarily until we develop in Chapter 6 a powerful method to deal with the problem of 
many noninteracting identical atoms in a box. We first treat an ideal gas of N atoms in a 
box, all atoms of different species or different isotopes. This is a simple extension of the 
one atom result. We then discuss the major correction factor that arises when all atoms are 
identical, of the same isotope of the same species. 
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If the orbital indices are all different, each entry will occur N! times in Z,%, whereas the 
entry should occur only once if the particles are identical. Thus, Z. 1’ overcounts the states by 
a factor of N!, and the correct partition function for N identical particles is 


1 1 
Zy= N! Zz" = Nl (ngv)* (68) 


in the classical regime. Here ng = (Mt/2nh?)*’? from (63). 

There is a step in the argument where we assume that all N occupied orbitals are always 
different orbitals. It is no simple matter to evaluate directly the error introduced by this 
approximation, but later we will confirm by another method the validity of (68) in the 
classical regime n « ng. The N! factor changes the result for the entropy of the ideal gas. 
The entropy is an experimentally measurable quantity, and it has been confirmed that the 
N! factor is correct in this low concentration limit. 


Energy. The energy of the ideal gas follows from the N particle partition 
function by use of (12): 


U = t7(@log Zy/ét) = 43Nt , (69) 
consistent with (65) for one particle. The free energy is 
F = —tlogZy = —tlogZ,* + tlogN!. (70) 


With the earlier result Z, = ngV = (Mz/2nh?)>*V and the Stirling approxima- 
tion log N! ~ NlogN — N, we have 


F = —tNlog[(Mz/2nh?)?/?V] + tNlogN — tN. (71) 


From the free energy we can calculate the entropy and the pressure of the ideal 
gas of N atoms. The pressure follows from (49): 


p = —(@F/éV), = Nt/V , (72) 
or 


| py= Nz, | (73) 


Ideal Gas: A First Look 
which is called the ideal gas law. In conventional units, 
pV = Nk,T. (74) 
The entropy follows from (49): 
o = —(0F/ét), = N log[(Mt/2nh?)?V] + 3N — NlogN +N, (75) 


or 


o = N[log(ng/n) + 3] . (76) 


with the concentration n = N/V. This result is known as the Sackur-Tetrode 
equation for the entropy of a monatomic ideal gas. It agrees with experiment. 
The result involves h through the term ng, so even for the classical ideal gas 
the entropy involves a quantum concept. We shall derive these results again in 
Chapter 6 by a direct method that does not explicitly involve the N! or identical 
particle argument. The energy (69) also follows from U = F + ta; with use of 
(71) and (76) we have U = 3Nrt. 


Example: Equipartition of energy. The energy U = 3Nr from (69) is ascribed to a contri- 
bution 41 from each “degree of freedom” of each particle, where the number of degrees of 
freedom i mber, jmensi f the space in which the atoms move: 3 in this 
example. In the classical form of statistical mechanics, the partition function contains the 
kinetic energy of the particles in an integral over the momentum components p,, Py. P:- 
For one free particle 


Z,x Jffexpt- (px? + Py? + p.”)/2Mt] dp, dpy dp. , (77) 


a result similar to (61). The limits of integration are + 00 for each component. The thermal 
average energy may be calculated by use of (12) and is equal to 3. 

The result is generalized in the classical theory. Whenever the hamiltonian of the system 
is homogeneous of degree 2 in a canonical momentum component, the classical limit of the 
thermal average kinetic energy associated with that momentum will be 4z. Further, if the 
hamiltonian is homogeneous of degree 2 in a position coordinate component, the thermal 
average potential energy associated with that coordinate will also be 4t. The result thus 
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Rotation 


Translation 


10 25 50 75100 250 500 1000 2500 5000 
Temperature, K 


Figure 3.9 Heat capacity at constant volume of one molecule of H, in 

the gas phase. The vertical scale is in fundamental units; to obtain a value 
in conventional units, multiply by kg. The contribution from the three 
translational degrees of freedom is 3; the contribution at high temperatures 
from the two rotational degrees of freedom is 1; and the contribution 

from the potential and kinetic energy of the vibrational motion in the 

high temperature limit is 1. The classical limits are attained when 

t » relevant energy level separations. 


applies to the harmonic oscillator in the classical limit. The quantum results for the har- 
monic oscillator and for the diatomic rotator are derived in Problems 3 and 6, respectively. 
At high temperatures the classical limits are attained, as in Figure 3.9. 


SEE ae ee 
Example: Entropy of mixing. 1n Chapter 1 we calculated the number of possible arrange- 
ments of A and B in a solid made up of N — r atoms A and r atoms B. We found in (1.20) 
for the number of arrangements: 


g(N,t) = Wo (78) 
The entropy associated with these arrangements is 
a(N,t) = logg(N,t) = log N! — log(N — 1)! — logt! , (79) 


and is plotted in Figure 3.10 for N = 20. This contribution to the total entropy of an alloy 


Ideal Gas: A First Look 


Mixing entropy 


0 0.2 0.4 0.6 0.8 1.0 


x— 


Alloy composition A,_, B, 


Figure 3.10 Mixing entropy of a random binary alloy as a function of 
the proportions of the constituent atoms A and B. The curve plotted 
was calculated for a total of 20 atoms. We see that this entropy is a 
maximum when A and B are present in equal proportions (x = 0.5), 
and the entropy is zero for pure A or pure B. 


system is called the entropy of mixing. The result (79) may be put in a more convenient form 
by use of the Stirling approximation: 


o(N,t) =~ NlogN — N —(N — thlog(N —t)+N—t~tlogt +t 
= NlogN — (N — dlog(N — 1) — tlogt 
= —(N — tlog(1 — t/N) — tlog(t/N) . 


or, with x = 1/N, 
o(x) = —N[(1 — x)log(1 — x) +'xlogx]. (80) 


This result gives the entropy of mixing of an alloy A,_,B, treated as a random (homo- 
geneous) solid solution. The problem is developed in detail in Chapter 11. 

We ask: Is the homogeneous solid solution the equilibrium condition of a mixture of A 
and B atoms, or is the equilibrium a two-phase system, such as a mixture of crystallites of 
pure A and crystallites of pure B? The complete answer is the basis of much of the science 
of metallurgy: the answer will depend on the temperature and on the interatomic inter- 
action energies U,,, Ugg, and Uy. In the special case that the interaction energies between 
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AA, BB, and AB neighbor pairs are all equal, the homogeneous solid solution will have a 
lower free energy than the corresponding mixture of crystallites of the pure elements. The 
free energy of the solid solution A, _,B, is 


F = Fo — to(x) = Fo + Nr[(1 — x)log(1 — x) + xlogx] , (81) 
which we must compare with 
F = (1 — x)Fo + xFy = Fo (82) 


for the mixture of A and B crystals in the proportion (1 — x) to x. The entropy of mixing 
is always positive—all entropies are positive—so that the solid solution has the lower 
free energy in this special case. 

There is a tendency for at least a very small proportion of any element B to dissolve in 
any other element A, even if a strong repulsive energy exists between a B atom and the 
surrounding A atoms. Let this repulsive energy be denoted by U, a positive quantity. Ifa 
very small proportion x « | of B atoms is present, the total repulsive energy is xN U, where 
XN is the number of B atoms. The mixing entropy (80) is approximately 


o = —xNlogx (83) 


in this limit, so that the free energy is 


F(x) = N(xU + txlogx) , (84) 
which has a minimum when 
6F/éx = N(U + tlogx +1) =0, (85) 
or 
xX = exp(—1)exp(— U/z). (86) 


This shows there is a natural impurity content in all crystals. 
SSS eS eee 


SUMMARY 
1. The factor 
P(e.) = exp(—é,/t)/Z 


is the probability of finding a system in a state s of energy ¢, when the system 


Problems 


is in thermal contact with a large reservoir at temperature t. The number of 
particles in the system is assumed constant. 


2. The partition function is 


Z = ¥ exp(—«,/2). 


3. The pressure is given by 
p = —(@U/éV), = t(6a/0V)y. 


4. The Helmholtz free energy is defined as F = U — to. It is a minimum in 
equilibrium for a system held at constant t, V. 


5.0 = —(6F/Ot; p = —(0F/0V),. 


6. F = —tlogZ. This result is very useful in calculations of F and of quantities 
such as p and o derived from F. 


7. For an ideal monatomic gas of N atoms of spin zero, 
Zy = (ngV)/N! , 
ifn = N/V « ng. The quantum concentration ng = (Mt/2mh?)*/*. Further, 
pV=Nt;  o = N[log(ng/n) + 9]; Cy = 3N. 


8. 


A process is reversible if the system remains infinitesimally close to the 
equilibrium state at all times during the process. 


PROBLEMS 


1. Free energy of a two state system. (a) Find an expression for the free 
energy as a function of t of a system with two states, one at energy 0 and one 
at energy «. (b) From the free energy, find expressions for the energy and entropy 
of the system. The entropy is plotted in Figure 3.11. 


2. Magnetic susceptibility. (a) Use the partition function to find an exact 
expression for the magnetization M and the susceptibility 7 = dM/dB as a 
function of temperature and magnetic field for the model system of magnetic 
moments in a magnetic field. The result for the magnetization is M = 
nmtanh(mB/z), as derived in (46) by another method. Here n is the particle 
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Figure 3.11 Entropy of a two-state system as a function 
of t/e. Notice that a(t) + log2 as t + ~. 
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Figure 3.12 Plot of the total magnetic moment as a function 
of mB/z. Notice that at low mB/t the moment is a linear function 
of mB/z, but at high mB/z the moment tends to saturate. 


concentration. The result is plotted in Figure 3.12. (b) Find the free energy and 
express the result as a function only of t and the parameter x = M/nm.(c) Show 
that the susceptibility is 7 = nm?/r in the limit mB « t. 


3. Free energy of a harmonic oscillator. A one-dimensional harmonic oscil- 
lator has an infinite series of equally spaced energy states, with ¢, = shw, where 
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Figure 3.13 Entropy versus temperature for harmonic 
oscillator of frequency w. 


s is a positive integer or zero, and « is the classical frequency of the oscillator. 
We have chosen the zero of energy at the state s = 0. (a) Show that for a 
harmonic oscillator the free energy is 


F = tlog[1 — exp(—ha/z)]. (87) 


Note that at high temperatures such that t > hw we may expand the argument 
of the logarithm to obtain F ~ tlog(hc/t). (b) From (87) show that the entropy 
is 


hoo/t 


ic ar re logf1 — exp(—he»/r)]. (88) 


The entropy is shown in Figure 3.13 and the heat capacity in Figure 3.14. 


4. Energy fluctuations. Consider a system of fixed volume in thermal contact 
with a reservoir. Show that the mean square fluctuation in the energy of the 
system is 


<(é — <e))?> = 27(0U/ér)y. (89) 
Here U is the conventional symbol for <e). Hint: Use the partition function Z 


to relate @U/ét to the mean square fluctuation. Also, multiply out the term 
(---)?. Note: The temperature t of a system is a quantity that by definition does 


84 Chapter 3: Boltzmann Distribution and Helmholtz Free Energy 


o 


Figure 3.14 Heat capacity versus temperature 
for harmonic oscillator of frequency w. The 
horizontal scale is in units of t/ha, which is 
identical with T/0,, where 0, is called the 
Einstein temperature. In the high temperature 
limit Cy > kg, or 1 in fundamental units. This 
value is known as the classical value. At low 
temperatures Cy decreases exponentially. 
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not fluctuate in value when the system is in thermal contact with a reservoir. 
Any other attitude would be inconsistent with our definition of the temperature 
of a system. The energy of such a system may fluctuate, but the temperature 
does not. Some workers do not adhere to a rigorous definition of temperature. 
Thus Landau and Lifshitz give the result 


«(At)?> = 17/Cy , (90) 


but this should be viewed as just another form of (89) with At set equal to 
AU/Cy. We know that AU = C, At, whence (90) becomes <(AU)?) = 17Cy. 
which is our result (89). 


5. Overhauser effect. Suppose that by a suitable external mechanical or 
electrical arrangement one can add ve to the energy of the heat reservoir 
whenever the reservoir passes to the system the quantum of energy ¢. The net 
increase of energy of the reservoir is (x — 1)e. Here « is some numerical factor, 
positive or negative. Show that the effective Boltzmann factor for this abnormal 
system is given by 


P(e) x exp[—(1 — a)e/t]. (91) 


This reasoning gives the statistical basis of the Overhauser effect whereby the 
nuclear polarization in a magnetic field can be enhanced above the thermal 
equilibrium polarization. Such a condition requires the active supply of energy 
to the system from an external source. The system is not in equilibrium, but is 
said to be in a steady state. Cf. A. W. Overhauser, Phys. Rev. 92, 411 (1953). 


6. Rotation of diatomic molecules. In our first look at the ideal gas we con- 
sidered only the translational energy of the particles. But molecules can rotate, 


Problems 


with kinetic energy. The rotational motion is quantized; and the energy levels 
of a diatomic molecule are of the form 


aj) = ii + Veo (92) 


where j is any positive integer including zero: j = 0, 1, 2,.... The multiplicity 
of each rotational level is g(j) = 2j + 1. (a) Find the partition function Z(t) 
for the rotational states of one molecule. Remember that Z is a sum over all 
states, not over all levels—this makes a difference. (b) Evaluate Z(t) approxi- 
mately for t > go, by converting the sum to an integral. (c) Do the same for 
t < &9, by truncating the sum after the second term. (d) Give expressions for 
the energy U and the heat capacity C, as functions of t, in both limits. Observe 
that the rotational contribution to the heat capacity of a diatomic molecule 
approaches I (or, in conventional units, kz) when t > é.(e) Sketch the behavior 
of U(t) and C(t), showing the limiting behaviors for t + oc and t > 0. 


7. Zipper problem. A zipper has N links; each link has a state in which it is 
closed with energy 0 and a state in which it is open with energy ¢. We require, 
however, that the zipper can only unzip from the left end, and that the link 
number s can only open if all links to the left (1,2,...,s — 1) are already open. 
(a) Show that the partition function can be summed in the form 


_ 1 — exp[-(N + 1e/t] 


z 1 — exp(—e/t) 


(93) 


(b) In the limit ¢ > t, find the average number of open links. The model is a 
very simplified model of the unwinding of two-stranded DNA molecules—see 
C. Kittel, Amer. J. Physics 37, 917 (1969). 


8. Quantum concentration. Consider one particle confined to a cube of side 
L; the concentration in effect ism = 1/L*. Find the kinetic energy of the particle 
when in the ground orbital. There will be a value of the concentration for which 
this zero-point quantum kinetic energy is equal to the temperature t. (At this 
concentration the occupancy of the lowest orbital is of the order of unity; the 
lowest orbital always has a higher occupancy than any other orbital.) Show that 
the concentration no thus defined is equal to the quantum concentration ng 
defined by (63), within a factor of the order of unity. 


9. Partition function for two systems. Show that the partition function 
Z(1 + 2) of two independent systems 1 and 2 in thermal contact at a common 
temperature t is equal to the product of the partition functions of the separate 
systems: 


Zl + 2) = Z(i)Z(2). (94) 
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10. Elasticity of polymers. The thermodynamic identity for a one-dimensional 
system is 


tdo = dU — fal (95) 


when f is the external force exerted on the line and di is the extension of the 
line. By analogy with (32) we form the derivative to find 


aay fs 
<n ),; (6) 


The direction of the force is opposite to the conventional direction of the 
pressure, 

We consider a polymeric chain of N links each of length p, with each link 
equally likely to be directed to the right and to the left. (a) Show that the number 
of arrangements that give a head-to-tail length of | = 2\s|p is 


g(N,—s) + g(N,s) = aN ata Sar (97) 
(b) For |s| < N show that 
a(1) = log[2g(N,0)] — /2Np?. (98) 
(c) Show that the force at extension | is 
f = It/Np?. (99) 


The force is proportional to the temperature. The force arises because the 
polymer wants to curl up: the entropy is higher in a random coil than in an 
uncoiled configuration. Warming a rubber band makes it contract; warming a 
steel wire makes it expand. The theory of rubber elasticity is discussed by 
H. M. James and E. Guth, Journal of Chemical Physics 11, 455 (1943); Journal 
of Polymer Science 4, 153 (1949); see also L. R. G. Treloar, Physics of rubber 
elasticity, Oxford, 1958. 


11. One-dimensional gas. Consider an ideal gas of N particles, each of mass 
M, confined to a one-dimensional line of length L. Find the entropy at tempera- 
ture t. The particles have spin zero. 
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[We consider] the distribution of the energy U among N oscillators of frequency 
v. If U is viewed as divisible without limit, then an infinite number of 
distributions are possible. We consider however—and this is the essential point 
of the whole calculation—U as made up of an entirely determined number of 
finite equal parts, and we make use of the natural constant h = 6.55 x 10~27 
erg-sec. This constant when multiplied by the common frequency v of the 
oscillators gives the element of energy € in ergs... . 

M. Planck 


Planck Distribution Function 


PLANCK DISTRIBUTION FUNCTION 


The Planck distribution describes the spectrum of the electromagnetic radiation 
in thermal equilibrium within a cavity. Approximately, it describes the emission 
spectrum of the Sun or of metal heated by a welding torch. The Planck distribu- 
tion was the first application of quantum thermal physics. Thermal electro- 
magnetic radiation is often called black body radiation. The Planck distribution 
also describes the thermal energy spectrum of lattice vibrations in an elastic 
solid. 

The word “mode” characterizes a particular oscillation amplitude pattern in 
the cavity or in the solid. We shall always refer to wo = 2nf as the frequency of 
the radiation. The characteristic feature of the radiation problem is that a mode 
of oscillation of frequency » may be excited only in units of the quantum of 
energy ho». The energy ¢, of the state with s quanta in the mode is 


& = sho , (1) 


where s is zero or any positive integer (Figure 4.1). We omit the zero point 
energy 4h. 

These energies are the same as the energies of a quantum harmonic oscillator 
of frequency w, but there is a difference between the concepts. A harmonic 


=o 
Figure 4.1 States of an oscillator that 


3e represents a mode of frequency w of an 
Soe electromagnetic field. When the oscillator is in 
the orbital of energy sho, the state is equivalent 


2e to s photons in the mode. 
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ba} @y 
e= hw, € = hw, 
@, py 


Figure 4.2 Representation in one dimension of two electromagnetic 
modes a and b, of frequency «, and «,. The amplitude of the electro- 
magnetic field is suggested in the figures for one photon and two photon 
occupancy of each mode, 


oscillator is a localized oscillator, whereas the electric and magnetic energy of 
an electromagnetic cavity mode is distributed throughout the interior of the 
cavity (Figure 4.2). For both problems the energy eigenvalues are integral 
multiples of hc», and this is the reason for the similarity in the thermal physics of 
the two problems. The language used to describe an excitation is different: s for 
the oscillator is called the quantum number, and s for the quantized electro- 
magnetic mode is called the number of photons in the mode. 

We first calculate the thermal average of the number of photons in a mode, 
when these photons are in thermal equilibrium with a reservoir at a temperature 
t. The partition function (3.10) is the sum over the states (1): 


Z = ¥ exp(—she/z). (2) 


s=0 


This sum is of the form }°x*, with x = exp(—/hw/t). Because x is smaller than 1, 
the infinite series may be summed and has the value 1/(1 — x), whence 


1 


ai exp(—ha/t)’ G3) 


Planck Law and Stefan-Boltzmann Law 


The probability that the system is in the state s of energy shw is given by the 
Boltzmann factor: 


= exp(—sha/t) 
Ra a (4) 
The thermal average value of s is 
<s> = ¥ sP(s) = Z~'Ysexp(—she/z). (5) 
s=0 
With y = he/t, the summation on the right-hand side has the form: 
d 
Ysexp(—sy) = — ~ Yexp(—sy) 
pr ie 1 ___exp(-y) 
~ — dy\1 — exp(—y) {1 — exp(—y)]?’ 
From (3) and (5) we find 
_ _exp(—y) 
SU 1 — exp(—y)* 
or 
1 | 
«s> (6) 


~ exp(ha/t)— 1° 


This is the Planck distribution function for the thermal average number of 
photons (Figure 4.3) in a single mode of frequency w. Equally, it is the average 
number of phonons in the mode. The result applies to any kind of wave field 
with energy in the form of (1). 


PLANCK LAW AND STEFAN-BOLTZMANN LAW 
The thermal average energy in the mode is 


ho 


<e> = <s>ho = Epa 


(7) 
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1.0 


Figure 4.3 Planck distribution as a function 
of the reduced temperature t/ha. Here <s(w)> 
is the thermal average of the number of 
photons in the mode of frequency w. A plot of 
<s(@)> + 4 is also given, where 4 is the effective 
zero point occupancy of the mode; the dashed 
line is the classical asymptote. Note that we 
write 


<s> + 4 = 4 coth(he/2r). 


0 0.5 1.0 


1 [Tics —— 


The high temperature limit t > hq@ is often called the classical limit. Here 
exp(hw/t) may be approximated as 1 + hw/t + ---, whence the classical 
average energy is 


«e> >t. (8) 
There is an infinite number of electromagnetic modes within any cavity. Each 


mode n has its own frequency w,. For radiation confined within a perfectly 
conducting cavity in the form of a cube of edge L, there is a set of modes of the 


form 
E, = E,9sin wt cos(n,zx/L)sin(n,xy/L) sin(n_1z/L) , (9a) 
E, = E,osin wt sin(n,2x/L)cos(nyzy/L) sin(n.nz/L) , (9b) 
E, = E,9sin wt sin(n,xx/L) sin(n,xy/L) cos(n,z/L). (9c) 


Here E,, E, and E, are the three electric field components, and E.o, Eo and 
E, are the corresponding amplitudes. The three components are not indepen- 
dent, because the field must be divergence-free: 


i GE, GE, OE 
dive ee (10) 


ox éy 0z 


Planck Law and Stefan-Boltzmann Law 


When we insert (9) into (10) and drop all common factors, we find the condition 
Exon, + Eyon, + E.on. = Eg-n = 0. (11) 


This states that the field vectors must be perpendicular to the vector n with 
the components n,, n, and n-_, so that the electromagnetic field in the cavity is a 
transversely polarized field. The polarization direction is defined as the direction 
of Ep. 

For a given triplet n,, n,, n, we can choose two mutually perpendicular 
polarization directions, so that there are two distinct modes for each triplet 


Nx, Ny, Nye 
On substitution of (9) in the wave equation 
a2 A2 a2 a2 
2{ @ é é O°E, 
eis tas t+ E, a 9 12 
( OY? \Oztgit\ 201 (12) 


with c the velocity of light, we find 
c7n(n,? + n,? +n.) = w7L?. (13) 


This determines the frequency @ of the mode in terms of the triplet of integers 
nx, Ny, n,. If we define 


n=(n2 +n? +n,7)'?, (14) 
then the frequencies are of the form 
@, = nne/L. (15) 


The total energy of the photons in the cavity is, from (7), 


ce (16) 


~ exp(iiw,/t) — 1° 


The sum is over the triplet of integers n,,n,,n,. Positive integers alone will 
describe all independent modes of the form (9). We replace the sum over n,, 
n,, n, by an integral over the volume element dn, dn, dn_ in the space of the mode 
indices. That is, we set 


LO) =F fan? dnt), (17) 


n 


93 


94 


Chapter 4: Thermal Radiation and Planck Distribution 


where the factor } = (4)° arises because only the positive octant of the space is 
involved. We now multiply the sum or integral by a factor of 2 because there are 
two independent polarizations of the electromagnetic field (two independent 
sets of cavity modes). Thus 


* he, 
U= 3 - 
xf, ane exp(ho,/t) — 1 


= (n7hc/L) iy dnn?> 


i 
—___—_—_., 18 
exp(henn/Lt) — 1 (18) 
with (15) for @,. Standard practice is to transform the definite integral to one 
over a dimensionless variable. We set x = mhen/Lt, and (18) becomes 


3 


U = (xhe/L)(tL/nhe)* if * dx (19) 


expx — 1 


The definite integral has the value x*/15; it is found in good standard tables 
such as Dwight (cited in the general references). The energy per unit volume is 


qT 2 
" - ie y 
with the volume V = L*. The result that the radiant energy density is propor- 
tional to the fourth power of the temperature is known as the Stefan-Boltzmann 
law of radiation. 

For many applications of this theory we decompose (20) into the spectral 
density of the radiation. The spectral density is defined as the energy per unit 
volume per unit frequency range, and is denoted as u,,. We can find u,, from 
(18) rewritten in terms of w: 


. h o 
UV = facou, =33 Ars eat (21) 


so that the spectral density is 


h oe 


7c exp(ha/t) — 1° (22) 


= 


Planck Law and Stefan-Boltzmann Law 


1.6 


14 a 
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Figure 4.4 Plot of x°/(e* — 1) with x = h/t. This 
function is involved in the Planck radiation law for the 
spectral density u,,. The temperature of a black body may 
be found from the frequency ©,,,, at which the radiant 
energy density is a maximum, per unit frequency range. 
This frequency is directly proportional to the temperature. 


This result is the Planck radiation law; it gives the frequency distribution of 
thermal radiation (Figure 4.4). Quantum theory began here. Note 
The entropy of the thermal photons can be found from the relation (3.344) ~~ 
at constant volume: do = dU/t, whence from (20), 
4n?V 


- 2 
7 = She 


Thus the entropy is 
o(t) = (4n7V/45)(t/he)>. (23) 


The constant of integration is zero, from (3.55) and the relation between F and oc. 
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A process carried out at constant photon entropy will have Vt* = constant. 
The measurement of high temperatures depends on the flux of radiant energy 
from a small hole in the wall of a cavity maintained at the temperature of 
interest. Such a hole is said to radiate as a black body—which means that the 
radiation emission is characteristic of a thermal equilibrium distribution. The 
energy flux density Jy is defined as the rate of energy emission per unit area. 
The flux density is of the order of the energy contained in a column of unit 
area and length equal to the velocity of light times the unit of time. Thus, 


Jy = [cU(2)/V] x (geometrical factor). (24) 


The geometrical factor is equal to 4; the derivation is the subject of Problem 15. 
The final result for the radiant energy flux is 


_ cU(t) __x?r* 


Dae ope ee (25) 
by use of (20) for the energy density U/V. The result is often written as 
Ja=onT*; (25) 
the Stefan-Boltzmann constant 
Op = 17kg*/60h3c* (26a) 


has the value 5.670 x 10-*§ W m~? K~* or 5.670 x 10-5 erg cm~? s-' K~4. 
(Here oz is not the entropy.) A body that radiates at this rate is said to radiate 
as a black body. A small hole in a cavity whose walls are in thermal equilibrium 
at temperature T will radiate as a black body at the rate given in (26). The rate 
is independent of the physical constitution of the walls of the cavity and de- 
pends only on the temperature. 


Emission and Absorption: Kirchhoff Law 


The ability of a surface to emit radiation is proportional to the ability of the 
surface to absorb radiation. We demonstrate this relation, first for a black body 
or black surface and, second, for a surface with arbitrary properties. An object 
is defined to be black in a given frequency range if all electromagnetic radiation 
incident upon it in that range is absorbed. By this definition a hole in a cavity is 
black if the hole is small enough that radiation incident through the hole will 


Estimation of Surface Temperature 


reflect enough times from the cavity walls to be absorbed in the cavity with 
negligible loss back through the hole. 

The radiant energy flux density J, from a black surface at temperature t is 
equal to the radiant energy flux density J, emitted from a small hole in a cavity 
at the same temperature. To prove this, let us close the hole with the black 
surface, hereafter called the object. In thermal equilibrium the thermal average 
energy flux from the black object to the interior of the cavity must be equal, 
but opposite, to the thermal average energy flux from the cavity to the black 
object. 

We prove the following: If a non-black object at temperature t absorbs a 
fraction a of the radiation incident upon it, the radiation flux emitted by the 
object will be a times the radiation flux emitted by a black body at the same 
temperature. Let a denote the absorptivity and e the emissivity, where the 
emissivity is defined so that the radiation flux emitted by the object is e times 
the flux emitted by a black body at the same temperature. The object must emit 
at the same rate as it absorbs if equilibrium is to be maintained. It follows that 
a =e. This is the Kirchhoff law. For the special case of a perfect reflector, a is 
zero, whence e is zero. A perfect reflector does not radiate. 

The arguments can be generalized to apply to the radiation at any frequency, 
as between m and m + dw. We insert a filter between the object and the hole in 
the black body. Let the filter reflect perfectly outside this frequency range, and 
let it transmit perfectly within this range. The flux equality arguments now 
apply to the transmitted spectral band, so that a(w) = e(w) for any surface 
in thermal equilibrium. 


Estimation of Surface Temperature 


One way to estimate the surface temperature of a hot body such as a star is 
from the frequency at which the maximum emission of radiant energy takes 
place (see Figure 4.4). What this frequency is depends on whether we look at the 
energy flux per unit frequency range or per unit wavelength range. For u,,, the 
energy density per unit frequency range, the maximum is given from the Planck 


law, Eq. (22), as 
= | 
a (Ger) =i0ig 
dx \expx — 1 


3 — 3exp(—x) = x. 


or 
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This equation may be solved numerically. The root is 
hore [Kig T= Mena 282 5 (27) 


as in Figure 4.4. 


QS SS a 


Example: Cosmic black body background radiation, A major recent discovery is that the 
universe accessible to us is filled with radiation approximately like that of a black body 
at 2.9K. The existence of this radiation (Figure 4.5) is important evidence for big bang 
cosmological models which assume that the universe is expanding and cooling with time. 
This radiation is left over from an early epoch when the universe was composed primarily 
of electrons and protons at a temperature of about 4000 K. The plasma of electrons and 
protons interacted strongly with electromagnetic radiation at all important frequencies, 
so that the matter and the black body radiation were in thermal equilibrium. By the time 
the universe had cooled to 3000 K, the matter was primarily in the form of atomic hydrogen. 
This interacts with black body radiation only at the frequencies of the hydrogen spectral 
lines. Most of the black body radiation energy thus was effectively decoupled from the 
matter. Thereafter the radiation evolved with time in a very simple way: the photon gas 
was cooled by expansion at constant entropy to a temperature of 2.9 K. The photon gas will 
remain at constant entropy if the frequency of each mode is lowered during the expansion 
of the universe with the number of photons in each mode kept constant. We show in (58) 
below that the entropy is constant if the number of photons in each mode is constant—the 
occupancies determine the entropy. 

After the decoupling the evolution of matter into heavier atoms (which are organized 
into galaxies, stars, and dust clouds) was more complicated than before decoupling. 
Electromagnetic radiation, such as starlight, radiated by the matter since the decoupling 
is superimposed on the cosmic black body radiation. 


TESS 8 


ELECTRICAL NOISE 


As an important example of the Planck law in one dimension, we consider the 
spontaneous thermal fluctuations in voltage across a resistor. These fluctuations, 
which are called noise, were discovered by J. B. Johnson and explained by 
H. Nyquist.* The characteristic property of Johnson noise is that the mean- 
square noise voltage is proportional to the value of the resistance R, as shown 
by Figure 4.6. We shall see that ¢V*) is also directly proportional to the tem- 


* H. Nyquist, Phys. Rev. 32, 110 (1928); a deeper discussion is given by C. Kittel, Elementary statis- 
tical physics, Wiley, 1958, Sections 27—30. 
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Figure 4.5 Experimental measurements of the spectrum of the cosmic black 
body radiation. Observations of the flux were made with microwave heterodyne 
receivers at frequencies below the peak, were deduced from optical measurements 
of the spectrum of interstellar CN molecules near the peak, and were measured 
with a balloon-borne infrared spectrometer at frequencies above the peak. 
Courtesy of P. L. Richards. 


perature t and the bandwidth Af of the circuit. (This section presumes a knowl- . 


edge of electromagnetic wave propagation at the intermediate level.) koa - Tao \ 
The Nyquist theorem gives a quantitative expression for the thermal noise Vi i 
voltage generated by a resistor in thermal equilibrium. The theorem is therefore F.L.OP 


needed in any estimate of the limiting signal-to-noise ratio of an experimental 


100 


Chapter 4: Thermal Radiation and Planck Distribution 


5x Lo eats ie ae] 
4 
2 3 
| 
as 2 © Carbon filament 
= + Advance wire 
Me Y x CuSO, in H,O 
fe | Nal in H,O 
| ©K,CrO, in H,O 
©Ca(NO,), inH,O 
0 


0 0.1 0.2 0.3 0.4 0.5 
Resistance component, in MQ 


Figure 4.6 Voltage squared versus resistance for various 
kinds of conductors, including electrolytes. After 
J. B. Johnson. 


apparatus. In the original form the Nyquist theorem states that the mean 
square voltage across a resistor of resistance R in thermal equilibrium at 
temperature t is given by 


<V?) =4RtAf , | (28) 


where Af is the frequency* bandwidth within which the voltage fluctuations 
are measured; all frequency components outside the given range are ignored. 
We show below that the thermal noise power per unit frequency range delivered 
by a resistor to a matched load is t; the factor 4 enters where it does because in 
the circuit of Figure 4.7, the power delivered to an arbitrary resistive load R’ is 
<P>R' = VR (29) 

(R+R)P? 


which at match (R’ = R) is <V)/4R. 


* In this section the word frequency refers to cycles per unit time, and not to radians per unit time. 


Electrical Noise 
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Figure 4.7 Equivalent circuit for a resistance R with 
a generator of thermal noise that delivers power to a 
load R’. The current 


ie 
I=—_, 
R+R 
so that the mean power dissipated in the load is 
<V2>R’ 
P? =<P)R' = ——_,, 
Se, (R' + R? 


which is a maximum with respect to R’ when R’ = R. 
In this condition the load is said to be matched to the 
power supply. At match, ? = <V)/4R. The filter 
enables us to limit the frequency bandwidth under 
consideration; that is, the bandwidth to which the mean 
square voltage fluctuation applies. 


Consider as in Figure 4.8 a lossless transmission line of length L and charac- 
teristic impedance Z, = R terminated at each end by a resistance R. Thus the 
line is matched at each end, in the sense that all energy traveling down the line 
will be absorbed without reflection in the appropriate resistance. The entire 
circuit is maintained at temperature t. 

A transmission line is essentially an electromagnetic system in one dimension. 
We follow the argument given above for the distribution of photons in thermal 
equilibrium, but now in a space of one dimension instead of three dimensions. 
The transmission line has two photon modes (one propagating in each direction) 
of frequency 2zf, = 2nn/L from (15), so that there are two modes in the fre- 
quency range 


of =c/L, (30) 
where c’ is the propagation velocity on the line. Each mode has energy 
ho (31) 


exp(hw/t) — 1 


101 


102 


Figure 4.8 Transmission line of length L with € 
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matched terminations, as conceived for the 

derivation of the Nyquist theorem. The 

characteristic impedance Z, of the transmission R R 
line has the value R. According to the 

fundamental theorem of transmission lines, the 

terminal resistors are matched to the line when 


their resistance has the same value R. aaa * 


in equilibrium, according to the Planck distribution. We are usually concerned 
with circuits in the classical limit hw « t so that the thermal energy per mode 
is t. It follows that the energy on the line in the frequency range Af is 


2 Af/of = 2tL Af/c’. (32) 
The rate at which energy comes off the line in one direction is 
t Af. (33) 


The power coming off the line at one end is all absorbed in the terminal 
impedance R at that end; there are no reflections when the terminal impedance 
is matched to the line. In thermal equilibrium the load must emit energy to the 
line at the same rate, or else its temperature would rise. Thus the power input 
to the load is 


Pi CR = Ape, (34) 


but V = 2RI, so that (28) is obtained. The result has been used in low tempera- 
ture thermometry, in temperature regions (Figure 4.9) where it is more con- 
venient to measure <V*) than t. Johnson noise is the noise across a resistor 
when no de current is flowing. Additional noise (not discussed here) appears 
when a de current flows. 


PHONONS IN SOLIDS: DEBYE THEORY 


So I decided to calculate the spectral distribution of the possible free vibrations 
for a continuous solid and to consider this distribution as a good enough 
approximation to the actual distribution. The sonic spectrum of a lattice must, 
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of course, deviate from this as soon as the wavelength becomes comparable to 
the distances of the atoms... . The only thing which had to be done was to 
adjust to the fact that every solid of finite dimensions contains a finite number 
of atoms and therefore has a finite number of free vibrations. ... At low enough 
temperatures, and in perfect analogy to the radiation law of Stefan- 
Boltzmann , . . , the vibrational energy content of a solid will be proportional 
to T+. 

P. Debye 


The energy of an elastic wave in a solid is quantized just as the energy of an 
electromagnetic wave in a cavity is quantized. The quantum of energy of an 
elastic wave is called a phonon. The thermal average number of phonons in an 
elastic wave of frequency « is given by the Planck distribution function, just 
as for photons: 


1 


eu exp(heo/t) — 1° 


(35) 


We assume that the frequency of an elastic wave is independent of the amplitude 
of the elastic strain. We want to find the energy and heat capacity of the elastic 
waves in solids. Several of the results obtained for photons may be carried 
over to phonons. The results are simple if we assume that the velocities of all 
elastic waves are equal—independent of frequency, direction of propagation, 
and direction of polarization. This assumption is not very accurate, but it helps 
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account for the general trend of the observed results in many solids, with a 
minimum of computation. 

There are two important features of the experimental results: the heat capacity 
of a nonmetallic solid varies as t* at low temperatures, and at high temperatures 
the heat capacity is independent of the temperature. In metals there is an extra 
contribution from the conduction electrons, treated in Chapter 7. 


Number of Phonon Modes 


There is no limit to the number of possible electromagnetic modes in a cavity, 
but the number of elastic modes in a finite solid is bounded. If the solid consists 
of N atoms, each with three degrees of freedom, the total number of modes is 
3N. An elastic wave has three possible polarizations, two transverse and one 
longitudinal, in contrast to the two possible polarizations of an electromagnetic 
wave. In a transverse elastic wave the displacement of the atoms is perpendicular 
to the propagation direction of the wave; in a longitudinal wave the displace- 
ment is parallel to the propagation direction. The sum of a quantity over all 
modes may be written as, including the factor 3, 


LOo-) = f4nn? dnc -4, (36) 


by extension of (17). Here n is defined in terms of the triplet of integers n,, n,, n-, 
exactly as for photons. We want to find n,,,, such that the total number of 
elastic modes is equal to 3N: 


Ag 4nn? dn = 3N. (37) 


In the photon problem there was no corresponding limitation on the total 
number of modes. It is customary to write np, after Debye, for n,,. Then (37) 
becomes 


4nnp? = 3N; np = (6N/z)'9. (38) 
The thermal energy of the phonons is, from (16), 


ho, 


U => <)> = 0 <s,ho, Gp heer ony ae 


(39) 


Number of Phonon Modes 


or, by (36) and (38), 


3x lg 2 ho, (40) 


Chas exp(h@,/t) — 1° 


By analogy with the evaluation of (19), with the velocity of sound v written in 
place of the velocity of light c, 


x3 


U = (3x7hv/2L\(cL/nhv)* if P dx Tee 


(41) 
where x = mhvn/Lz. For L* we write the volume V. Here, with (38), the upper 
limit of integration is 
Xp = mthunp/Lt = hv(6x?N/V)'3/z , (42) 
usually written as 
Xp = 0/T = k,O/t , (43) 
where @ is called the Debye temperature : 
6 = (hv/kp)(6n7N/V)*. (44) 


The result (41) for the energy is of special interest at low temperatures such 
that T « 0. Here the limit xp on the integral is much larger than unity, and xp 
may be replaced by infinity. We note from Figure 4.4 that there is little contri- 
bution to the integrand out beyond x = 10. For the definite integral we have 


3 4 


w x Tv 
iF Crees tet Ce (45) 


as earlier. Thus the energy in the low temperature limit is 


Bi 3n*Nt* 2 3n*NkgT* 


UT) = Sea)? 58 


(46) 


proportional to T*. The heat capacity is, fort « k,0 or T « 0, 


0U 12n*N / « \3 
(ea = suey 47 
Y (=), 5 () (ia) 
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Figure 4.10 Low temperature heat capacity of solid argon, plotted against 
T® to show the excellent agreement with the Debye T° law. The value of @ 
from these data is 92 K. Courtesy of L. Finegold and N. E. Phillips. 


In conventional units, 


oU 12n*Nkg (T\* 
eat) any: wm 


This result is known as the Debye T° law.* Experimental results for argon are 
plotted in Figure 4.10. Representative experimental values of the Debye tem- 
perature are given in Table 4.1. The calculated variation of Cy versus 7/@ is 
plotted in Figure 4.11. The high temperature limit T » @ is the subject of 
Problem 11. Several related thermodynamic functions for a Debye solid are 
given in Table 4.2 and are plotted in Figure 4.12. 


* P. Debye, Annalen der Physik 39, 789 (1912); M. Born and T. y. Karman, Physikalische Zeitschrift 
13, 297 (1912); 14,65 (1913). 


107 


Number of Phonon Modes 


‘sanyea [eyUSUILIOdXA dYI JO WUN] aiNyRIaduID} MO] 2Y) SOUP Gg 241 UO O10z ydiiosqns ay] :4LON 
nil T 
| L0c t91 
IT | ON | PAW | Wd Sa JO| 4a | WO | wy | ng | dn n| td UL 
OIZ | Oz | O1z 007 | 

NT |} QA | WL | 4a | OH | 4d} QL | PD | nq | ws hae PN | Jd 8) 
Re oN ea) WwW 
6IT | SOL} $82 | GIL | SOT | OFZ | OZr | OOS | OFF | OOF | OPT | cSt | CHI | OTT) BE 
mY] WwW | od | Mt} dd] WL) 9H] OV] | HT] SO} eM] MM] BL| JH | eT] ea} 80 
+9 est | TIZ | OOT) SOI | GOZ | SZZ | HLT | O8r | 009 OSh | SLZ | 16c | O8Z | LOI | 9g 
2X I aL | 4s | sug} uw) po} 3v | pd} gal] ny OL | OW | IN 1Z A S| WW 
tL 06 | @8Z | PLE | OZE | LZE | EME | OSh | Shr | OLF | OIh | OF9 | OBE | OzH | OE | OFZ] 16 
mM | 1 3S sy | 8| 8) 7] IN| 9] ow | UN Ee) A LA S|) ¥D'| ot 
ra) Sh9 | 8tP O0p | 8ST 
qv ID s d S| Iv SW) UN 
SL O€TT Orrl | the 
oN a oO N re) a sa) i 


1 ur °g amyesaduiay akqaq T"p age, 


108 Chapter 4: Thermal Radiation and Planck Distribution 


Figure 4.11 Heat capacity Cy of a solid, 
according to the Debye approximation. The 
vertical scale is in J mol~' K~'. The 
horizontal scale is the temperature 
normalized to the Debye temperature 0. The 
region of the T° law is below 0.10. The 
asymptotic value at high values of 7/6 is 
24.943 J mol”! K~. 
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2 20.59 18.30 55 —3.64 
3 16.53 10.71 2.36 =1.21 
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5 9.20 4.08 0.58 -0.23 
6 6.23 2.64 0.323 —0.118 
7 4.76 1.77 0.187 —0.066 
8 3.45 1.22 0.114 —0.039 
2) 2.53 0.874 0.073 —0.025 
10 1.891 0.643 0.048 —0.016 
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Figure 4.12 Energy U and free energy F = U — to ofa 


solid, according to the Debye theory. The Debye temperature 
of the solid is 0. 


SUMMARY 


1. The Planck distribution function is 


1 
a exp(ha/t) — 1 i 


Summary 


for the thermal average number of photons in a cavity mode of frequency w. 


2. The Stefan-Boltzmann law is 


U wed os 
pat pashan ln ts 
Vo inte °* 


for the radiant energy density in a cavity at temperature t. 


109 


110 


Chapter 4: Thermal Radiation and Planck Distribution 


3. The Planck radiation law is 


h oF 


7 we exp(fi@/t) — 1 ; 


Uy 


for the radiation energy per unit volume per unit range of frequency. 


4. The flux density of radiant energy is Jy = ,7T*, where cz is the Stefan- 
Boltzmann constant 27k,*/60h°c*. 


5. The Debye low temperature limit of the heat capacity of a dielectric solid 
is, in conventional units, 


_ 12n*Nkp (T\? 
= Se 


where the Debye temperature 


@ = (hv/k,)(6n7N/V)"3. 


PROBLEMS 


1. Number of thermal photons. Show that the number of photons )’¢s,,> in 
equilibrium at temperature t in a cavity of volume V is 


N = 2.404n7?V(t/he)?. (48) 


From (23) the entropy is o = (4n?V/45)(t/hc)°, whence o/N ~ 3.602. It is 
believed that the total number of photons in the universe is 10° larger than the 
total number of nucleons (protons, neutrons). Because both entropies are of 
the order of the respective number of particles (see Eq. 3.76), the photons 
provide the dominant contribution to the entropy of the universe, although 
the particles dominate the total energy. We believe that the entropy of the 
photons is essentially constant, so that the entropy of the universe is approxi- 
mately constant with time. 


2. Surface temperature of the Sun. The value of the total radiant energy flux 
density at the Earth from the Sun normal to the incident rays is called the solar 
constant of the Earth. The observed value integrated over all emission wave- 
lengths and referred to the mean Earth-Sun distance is: 


solar constant = 0.136Js~!cm7?. (49) 


Problems 


(a) Show that the total rate of energy generation of the Sun is 4 x 107° Js~!. 
(b) From this result and the Stefan-Boltzmann constant o, = 5.67 x 
10-1? Js! cm~? K ~*, show that the effective temperature of the surface of the 
Sun treated as a black body is T = 6000 K. Take the distance of the Earth from 
the Sun as 1.5 x 10!3 cm and the radius of the Sun as7 x 10!°cm. 


3. Average temperature of the interior of the Sun. (a) Estimate by a dimen- 
sional argument or otherwise the order of magnitude of the gravitational self- 
energy of the Sun, with M, = 2 x 10°*gand Ro = 7 x 10'° cm. The gravi- 
tational constant G is 6.6 x 107° dyne cm? g~?. The self-energy will be negative 
referred to atoms at rest at infinite separation. (b) Assume that the total thermal 
kinetic energy of the atoms in the Sun is equal to —} times the gravitational 
energy. This is the result of the virial theorem of mechanics. Estimate the average 
temperature of the Sun. Take the number of particles as 1 x 10°’. This estimate 
gives somewhat too low a temperature, because the density of the Sun is far 
from uniform. “The range in central temperature for different stars, excluding 
only those composed of degenerate matter for which the law of perfect gases 
does not hold (white dwarfs) and those which have excessively small average 
densities (giants and supergiants), is between 1.5 and 3.0 x 10’ degrees.” 
(O. Struve, B. Lynds, and H. Pillans, Elementary astronomy, Oxford, 1959.) 


4. Age of the Sun. Suppose 4 x 10°°Js~' is the total rate at which the Sun 
radiates energy at the present time. (a) Find the total energy of the Sun available 
for radiation, on the rough assumptions that the energy source is the conversion 
of hydrogen (atomic weight 1.0078) to helium (atomic weight 4.0026) and that 
the reaction stops when 10 percent of the original hydrogen has been converted 
to helium. Use the Einstein relation E = (AM)c?. (b) Use (a) to estimate the 
life expectancy of the Sun. It is believed that the age of the universe is about 
10 x 10° years. (A good discussion is given in the books by Peebles and by 
Weinberg, cited in the general references.) 


5. Surface temperature of the Earth. Calculate the temperature of the surface 
of the Earth on the assumption that as a black body in thermal equilibrium it 
reradiates as much thermal radiation as it receives from the Sun, Assume also 
that the surface of the Earth is at a constant temperature over the day-night 
cycle. Use Tg = 5800K; Ro = 7 x 10'°cm; and the Earth-Sun distance of 
1.5 x 10'3cm. 


6. Pressure of thermal radiation. Show for a photon gas that: 


(a) p = —(0U/aV), = —S sfi(dw,/dV) , (50) 
J 


i 
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where s; is the number of photons in the mode j; 
(b) do,/dV = —a,/3V; (51) 
(c) p = U/3V. (52) 


Thus the radiation pressure is equal to } x (energy density). 

(d) Compare the pressure of thermal radiation with the kinetic pressure of a 
gas of H atoms at a concentration of 1 molecm™® characteristic of the Sun. 
At what temperature (roughly) are the two pressures equal? The average 
temperature of the Sun is believed to be near 2 x 107K. The concentration is 
highly nonuniform and rises to near 100molecm~* at the center, where the 
kinetic pressure is considerably higher than the radiation pressure. 


7. Free energy of a photon gas. (a) Show that the partition function of a 
photon gas is given by 


Z =[[[1 — exp(-ho,/7)]! , (53) 


where the product is over the modes n. (b) The Helmholtz free energy is found 
directly from (53) as 


F =t) log[1 — exp(—ho,/7)]. (54) 


Transform the sum to an integral; integrate by parts to find 
F = —17V7*/45hRc3, (55) 


8. Heat shields. A black (nonreflective) plane at temperature T,, is parallel 
to a black plane at temperature T,. The net energy flux density in vacuum be- 
tween the two planes is Jy = o,(T,* — T,*), where cz is the Stefan-Boltzmann 
constant used in (26). A third black plane is inserted between the other two and 
is allowed to come to a steady state temperature T,,. Find T,, in terms of T,, 
and T;, and show that the net energy flux density is cut in half because of the 
presence of this plane. This is the principle of the heat shield and is widely 
used to reduce radiant heat transfer. Comment: The result for N independent 
heat shields floating in temperature between the planes T, and T;, is that the 
net energy flux density is Jy = o,(T,* — T;,*)/(N + 1). 


9. Photon gas in one dimension. Consider a transmission line of length L on 
which electromagnetic waves satisfy the one-dimensional wave equation 
v?6*E/0x? = 67E/ét”, where E is an electric field component. Find the heat 
capacity of the photons on the line, when in thermal equilibrium at temperature 


Problems 


t. The enumeration of modes proceeds in the usual way for one dimension: 
take the solutions as standing waves with zero amplitude at each end of the 
line. 


10. Heat capacity of intergalactic space. Intergalactic space is believed to be 
occupied by hydrogen atoms in a concentration + 1atomm *. The space is 
also occupied by thermal radiation at 2.9 K, from the Primitive Fireball. Show 
that the ratio of the heat capacity of matter to that of radiation is ~ 107°. 


11. Heat capacity of solids in high temperature limit. Show that in the limit 
T > @ the heat capacity of a solid goes towards the limit Cy + 3Nkg, in 
conventional units. To obtain higher accuracy when T is only moderately 
larger than 0, the heat capacity can be expanded as a power series in 1/T, of 
the form 


Cy = 3Nkg x = Sait" (56) 


Determine the first nonvanishing term in the sum. Check your result by inserting 
T = 0 and comparing with Table 4.2. 


12. Heat capacity of photons and phonons. Consider a dielectric solid with a 
Debye temperature equal to 100K and with 107? atomscm~*. Estimate the 
temperature at which the photon contribution to the heat capacity would be 
equal to the phonon contribution evaluated at 1 K. 


13. Energy fluctuations in a solid at low temperatures. Consider a solid of N 
atoms in the temperature region in which the Debye T° law is valid. The solid 
is in thermal contact with a heat reservoir. Use the results on energy fluctuations 
from Chapter 3 to show that the root mean square fractional energy fluctuation 
F is given by 


3 
F? = (le — e>)?/Ke)? ® or (7) ' (57) 


Suppose that T = 10-?K; 6 = 200K; and N ~ 10'5 fora particle 0.01 cm on 
a side; then ¥ ~ 0.02. At 10~*K the fractional fluctuation in energy is of the 
order of unity for a dielectric particle of volume 1 cm*. 


14. Heat capacity of liquid *He at low temperatures. The velocity of longitu- 
dinal sound waves in liquid *He at temperatures below 0.6 K is 2.383 x 10*cm 
s-'. There are no transverse sound waves in the liquid. The density is 
0.145gcm~*. (a) Calculate the Debye temperature. (b) Calculate the heat 
capacity per gram on the Debye theory and compare with the experimental 
value Cy = 0.0204 x T?,inJg~'K~'. The T? dependence of the experimental 
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value suggests that phonons are the most important excitations in liquid He 
below 0.6 K. Note that the experimental value has been expressed per gram of 
liquid. The experiments are due to J. Wiebes, C. G. Niels-Hakkenberg, and 
H. C. Kramers, Physica 32, 625 (1957). 


15. Angular distribution of radiant energy flux. (a) Show that the spectral 
density of the radiant energy flux that arrives in the solid angle dQ is 
cu,, CoS 6 - dQ/4x, where 0 is the angle the normal to the unit area makes with 
the incident ray, and u,, is the energy density per unit frequency range. (b) Show 
that the sum of this quantity over all incident rays is 4cu,,. 


16. Image of a radiant object. Let a lens image the hole in a cavity of area 
Ay on a black object of area Ao. Use an equilibrium argument to relate the 
product 4yQ), to AgQ where Qy and Q, are the solid angles subtended by the 
lens as viewed from the hole and from the object. This general property of 
focusing systems is easily derived from geometrical optics. It is also true when 
diffraction is important. 


17. Entropy and occupancy. We argued in this chapter that the entropy of the 
cosmic black body radiation has not changed with time because the number 
of photons in each mode has not changed with time, although the frequency of 
each mode has decreased as the wavelength has increased with the expansion 
of the universe. Establish the implied connection between entropy and oc- 
cupancy of the modes, by showing that for one mode of frequency w the entropy 
is a function of the photon occupancy <s) only: 


o = <s + 1 log<s + 1) — <5) log<s>. (58) 


It is convenient to start from the partition function. 


18. Isentropic expansion of photon gas. Make the approximation that all rays 
are nearly parallel (all axial angles small). Consider the gas of photons of the 
thermal equilibrium radiation in a cube of volume V at temperature t. Let 
the cavity volume increase; the radiation pressure performs work during the 
expansion, and the temperature of the radiation will drop. From the result 
for the entropy we know that tV’’? is constant in such an expansion. (a) Assume 
that the temperature of the cosmic black-body radiation was decoupled from 
the temperature of the matter when both were at 3000 K. What was the radius 
of the universe at that time, compared to now? If the radius has increased 
linearly with time, at what fraction of the present age of the universe did the 
decoupling take place? (b) Show that the work done by the photons during the 
expansion is 


W = (n7/15hc° )Vit;(t; — Ty). 


The subscripts i and f refer to the initial and final states. 


Problems 


19. Reflective heat shield and Kirchhoff’s law. Consider a plane sheet of mate- 
rial of absorptivity a, emissivity e, and reflectivity r = 1 — a. Let the sheet be 
suspended between and parallel with two black sheets maintained at tempera- 
tures t, and t,. Show that the net flux density of thermal radiation between the 
black sheets is (1 — r) times the flux density when the intermediate sheet is 
also black as in Problem 8, which means with a = e = 1;r = 0. Liquid helium 
dewars are often insulated by many, perhaps 100, layers of an aluminized 
Mylar film called Superinsulation. 
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We considered in Chapter 2 the properties of two systems in thermal contact, 
and we were led naturally to the definition of the temperature. If the two systems 
have the same temperature, there is no net energy flow between them. If the 
temperatures of two systems are different, energy will flow from the system 
with the higher temperature to the system with the lower temperature. 

Now consider systems that can exchange particles as well as energy. Such 
systems are said to be in diffusive (and thermal) contact: molecules can move 
from one system to the other by diffusion through a permeable interface. Two 
systems are in equilibrium with respect to particle exchange when the net 
particle flow is zero. 

The chemical potential governs the flow of particles between the systems, just 
as the temperature governs the flow of energy. If two systems with a single 
chemical species are at the same temperature and have the same value of the 
chemical potential, there will be no net particle flow and no net energy flow 
between them. If the chemical potentials of the two systems are different, 
particles will flow from the system at the higher chemical potential to the 
system at the lower chemical potential. As an example, the chemical potential 
of electrons at one terminal of a storage battery is higher than at the other 
terminal. When the terminals are connected by a wire, electrons will flow in the 
wire from high to low chemical potential. 

Consider the establishment of diffusive equilibrium between two systems 
3, and 8, that are in thermal and diffusive contact. We maintain t constant 
by placing both systems in thermal contact (Figure 5.1) with a large reservoir 
G. We found earlier that for a single system 8 in thermal equilibrium with a 
reservoir ®, the Helmholtz free energy of 8 will assume the minimum value 
compatible with the common temperature t and with other restraints on the 
system, such as the volume and the number of particles. This result applies 
equally to the combined 8, + 8, in equilibrium with ®. In diffusive equilib- 
rium between 8, and 8), the particle distribution N,, N, between the systems 
makes the total Helmholtz free energy 


F=F, + F, = U, + U2 — x(a, + a) (1) 


a minimum, subject to N = N, + N, = constant. Because N is constant, the 
Helmholtz free energy of the combined system is a minimum with respect to 


Definition of Chemical Potential 


aN, 1 Valve d. IN. 2 
negative positive 


Figure 5.1 Example of two systems, 3, and 8,, in thermal contact with each other 
and with a large reservoir ®, forming a closed total system. By opening the valve, 
8, and 8, can be brought in diffusive contact while remaining at the common 
temperature t. The arrows at the valve have been drawn for a net particle transfer 
from 8, to 8). 


variations 6N, = —6dN>. At the minimum, 
dF = (GF ,/0N,),dN, + (@F2/0N2),dN2 =0, (2) 
with V,, V2, also held constant. With dN, = —dN3, we have 
dF = [(@F,/0N,), — (@F2/@N2),JdN, =0, (3) 
so that at equilibrium 
(OF \/ON,), = (@F:/EN2),- (4) 
DEFINITION OF CHEMICAL POTENTIAL 


We define the chemical potential as 


(5) 
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where y is the Greek letter mu. Then 


—i Tae, 
Hi = He 


expresses the condition for diffusive equilibrium. If 1, > 4, we see from (3) 
that dF will be negative when dN, is negative: When particles are transferred 
from 8, to 8,, the value of dN, is negative, and dN, is positive. Thus the 
free energy decreases as particles flow from 8, to 8,: that is, particles flow 
from the system of high chemical potential to the system of low chemical 
potential. The strict definition of y is in terms of a difference and not a deriva- 
tive, because particles are not divisible: 


B(t,V.N) = F(t,V,N) — F(t,V.N — 1). (6) 


The chemical potential regulates the particle transfer between systems in 
contact, and it is fully as important as the temperature, which regulates the 
energy transfer. Two systems that can exchange both energy and particles are 
in combined thermal and diffusive equilibrium when their temperatures and 
chemical potentials are equal: t; = t2; 4) = [p- 

A difference in chemical potential acts as a driving force for the transfer of 
particles just as a difference in temperature acts as a driving force for the transfer 
of energy. 

If several chemical species are present, each has its own chemical potential. 
For species j, 


Hy = (OF /ON jev.nyn....> (7) 


where in the differentiation the numbers of all particles are held constant except 
for the species j. 


SS SS SE EES 


Example: Chemical potential of the ideal gas. \n (3.70) we showed that the free energy 
of the monatomic ideal gas is 


F = —1[NlogZ, —logN!] , (8) 


Definition of Chemical Potential 


where 
Z, =NgV = (Mz/2ah?P'?V (9) 


is the partition function for a single particle. From (8), 
Baya d 
pu = (GF/6N),y = —1 logZ, — an 108N! 5 (10) 


If we use the Stirling approximation for N! and assume that we can differentiate the 
factorial, we find Se, 
—— 


d 
test ! 
ay BN! 


d = 
aN [log./2x + (N + 3)logN — N] 


(11) 


1 1 
logN + (N +4):—-—1=logN+—_, 
gs ( 2) N logN + aN 
which approaches log N for large values of N. Hence the chemical potential of the ideal 

gas is 


pw = —t(logZ, — logN) = tlog(N/Z,) . 


or, by (9), 


pb see ees 
f= tlog(n/ng) , | (12a) 
ae es 


where n = N/V is the concentration of particles and ng = (Mz/2h?)*? is the quantum 
concentration defined by (3.63). 

If we use p = F(N) — F(N — 1) from (6) as the definition of y, we do not need to use the 
Stirling approximation. From (8) we obtain p = —t[logZ, — log N], which agrees with 
(12). The result depends on the concentration of particles, not on their total number or on 
the system volume separately. By use of the ideal gas law p = nt we can write (12) as 


b= tlog(p/tng). (12b) 


The chemical potential increases as the concentration of particles increases. This is what 
we expect intuitively: particles flow from higher to lower chemical potential, from higher to 
lower concentration. Figure 5.2 shows the dependence on concentration of an ideal gas 
composed of electrons or of helium atoms, for two temperatures, the boiling temperature 
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> Gas 
¢ not 
He at classical 
boiling 
[ point 
10% 107 101% 10?! 1028 10*5 
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Figure 5.2. The concentration dependence of , in units of r, of an ideal gas 
composed of electrons or helium atoms, at 4.2 K and 300 K. To be in the classical 
regime with n « ng, a gas must have a value of —y at least t. For electrons this is 
satisfied only for concentrations appreciably less than those in metals, as in the 
range of typical semiconductors. For gases it is always satisfied under normal 
conditions. 


of liquid helium at atmospheric pressure, 4.2K, and room temperature, 300K. Atomic 
and molecular gases always have negative chemical potentials under physically realizable 
conditions: at classical concentrations such that n/ng « 1, we see from (12) that is 
negative. 


it ae SS Sie RT Pr ee 


Internal and Total Chemical Potential 


The best way to understand the chemical potential is to discuss diffusive 
equilibrium in the presence of a potential step that acts on the particles. This 


Internal and Total Chemical Potential 
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Figure 5.3 A potential step between two 
3, Bi systems of charged particles can be established 
by applying a voltage between the systems. For 
the voltage polarity shown, the potential 
energy of positive particles with charge q > 0 
y= in system 8, would be raised by gAV with 
AV respect to 8,. The potential energy of negative 
particles would be lowered in 8, with respect 


to $,. 


problem has wide application and includes the semiconductor p—n junction 
discussed in Chapter 13. We again consider two systems, 8, and $5, at the 
same temperature and capable of exchanging particles, but not yet in diffusive 
equilibrium. We assume that initially 4. > 4, and we denote the initial non- 
equilibrium chemical potential difference by Ay(initial) = “2, — 4. Now let a 
difference in potential energy be established between the two systems, such 
that the potential energy of each particle in system $, is raised by exactly 
Ax(initial) above its initial value. If the particles carry a charge q, one simple 
way to establish this potential step is to apply between the two systems a voltage 
AV such that 


qAV = q(V2 — Vy) = Au(initial) , (13) 


with the polarity shown in Figure 5.3. A difference in gravitational potential 
also can serve as a potential difference: when we raise a system of particles each 
of mass M by the height h, we establish a potential difference Mgh, where g is 
the gravitational acceleration. 

Once a potential step is present, the potential energy of the particles produced 
by this step is included in the energy U and in the free energy F of the system. 
If in Figure 5.3 we keep the free energy of system 8, fixed, the step raises the 
free energy of 8, by N, Au(initial) = N,qAV relative to its initial value. In 
the language of energy states, to the energy of each state of 8, the potential 
energy N, Au(initial) has been added. The insertion of the potential barrier 
specified by (13) raises the chemical potential of 8, by Au(initial), to make the 
final chemical potential of 8, equal to that of 85: 


yiy(final) =, (initial) + [j2(initial) — 1,(initial)] 
= p,(initial) = (final). (14) 


Fig 13-7 pS 
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When the barrier was inserted, 1, was held fixed. Thus the barrier gAV = 
H,(initial) — 2,(initial) brings the two systems into diffusive equilibrium. 


Z : 2 | 
The chemical potential is equivalent to a true potential energy: the | 
difference in chemical potential between two systems is equal to the 
potential barrier that will bring the two systems into diffusive equilibrium. 


| 


This statement gives us a feeling for the physical effect of the chemical poten- 
tial, and it forms the basis for the measurement of chemical potential differences 
between two systems. To measure 1 — 4,, we establish a potential step 
between two systems that can transfer particles, and we determine the step 
height at which the net particle transfer vanishes. 

Only differences of chemical potential have a physical meaning. The absolute 
value of the chemical potential depends on the zero of the potential energy 
scale. The ideal gas result (12) depends on the choice of the zero of energy of a 
free particle as equal to the zero of the kinetic energy. 

When external potential steps are present, we can express the total chemical 
potential of a system as the sum of two parts: 


H= Hoot = Hext + Hin (15) 


Here /,,, is the potential energy per particle in the external potential, and juin, 
is the internal chemical potential* defined as the chemical potential that would 
be present if the external potential were zero. The term ju,,, may be mechanical, 
electrical, magnetic, gravitational, etc. in origin. The equilibrium condition 
Hy = |, can be expressed as 


Apex: = —Aptint- (16) 


Unfortunately, the distinction between external and internal chemical potential 
sometimes is not made in the literature. Some writers, particularly those working 
with charged particles in the fields of electrochemistry and of semiconductors, 
often mean the internal chemical potential when they use the words chemical 
potential without a further qualifier. 

The total chemical potential may be called the electrochemical potential if 
the potential barriers of interest are electrostatic. Although the term electro- 


* Gibbs called y the potential and j;,, the intrinsic potential. He recognized that a voltmeter mea- 
sures differences in jin). 
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Internal and Total Chemical Potential 
System (2) 
Figure 5.4 A model of the variation of 
atmospheric pressure with altitude: two 
h volumes of gas at different heights in a uniform 
gravitational field, in thermal and diffusive 
contact. 
System (1) 


chemical potential is clear and unambiguous, we shall use “total chemical 
potential.” The use of “chemical potential” without an adjective should be 
avoided in situations in which any confusion about its meaning could occur. 


Example: Variation of barometric pressure with altitude. The simplest example of the 
diffusive equilibrium between systems in different external potentials is the equilibrium 
between layers at different heights of the Earth’s atmosphere, assumed to be isothermal. 
The real atmosphere is in imperfect equilibrium; it is constantly upset by meteorological 
processes, both in the form of macroscopic air movements and of strong temperature 
gradients from cloud formation, and because of heat input from the ground. We may 
make an approximate model of the atmosphere by treating the different air layers as 
systems of ideal gases in thermal and diffusive equilibrium with each other, in different 
external potentials (Figure 5.4). If we place the zero of the potential energy at ground level, 
the potential energy per molecule at height h is Mgh, where M is the particle mass and g the 
gravitational acceleration. The internal chemical potential of the particles is given by (12). 
The total chemical potential is 


H = tlog(n/ng) + Mgh. (17) 
In equilibrium, this must be independent of the height. Thus 
tlog[n(h)/ng] + Mgh = tlog[n(0)/ng] . 
and the concentration n(h) at height h satisfies 


n(h) = n(0)exp(— Mgh/z). (18) 
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The pressure of an ideal gas is proportional to the concentration; therefore the pressure 
at altitude h is 


p(h) = p(0)exp(—Mgh/t) = p(0)exp(—h/h,). (19) 


This is the barometric pressure equation. It gives the dependence of the pressure on altitude 
in an isothermal atmosphere of a single chemical species. At the characteristic height h, = 
t/Mg the atmospheric pressure decreases by the fraction e~' = 0.37. To estimate the 
characteristic height, consider an isothermal atmosphere composed of nitrogen molecules 
with a molecular weight of 28. The mass ofan N, molecule is 48 x 10~?*gm. Ata tempera- 
ture of 290K the value of t = kgT is 4.0 x 107'*erg. With g = 980cms ”, the charac- 
teristic height h, is 8.5km, approximately 5 miles. Lighter molecules, H, and He, will 
extend farther up, but these have largely escaped from the atmosphere: see Problem 2. 

Because the Earth’s atmosphere is not accurately isothermal, n(h) has a more complicated 
behavior. Figure 5.5 is a logarithmic plot of pressure data between 10 and 40 kilometers, 
taken on rocket flights. The data points fall near a straight line, suggesting roughly iso- 
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thermal behavior. The straight line connecting the data points of Figure 5.5 spans a pressure 
range p(h2):p(h,) = 1000:1, over an altitude range from h, = 2km to h, = 48km. Now, 
from (19), 


p(hy) _ Mg 
OB Fh) Te te > (20) 


so that the slope of the line is Mg/t, which leads to T = t/kg = 227K. The non-intersection 
of the observed curve with the point h = 0, p(h)/p(0) = 1, is caused by the higher tempera- 
ture at lower altitudes. 

The atmosphere consists of more than one species of gas. In atomic percent, the com- 
position of dry air at sea level is 78 pet N>, 21 pct O,, and 0.9 pet Ar; other constituents 
account for less than 0.1 pet each. The water vapor content of the atmosphere may be 
appreciable: at T = 300K (27°C), a relative humidity of 100 pet corresponds to 3.5 pet 
H,0. The carbon dioxide concentration varies about a nominal value of 0.03 pet. In an 
ideal static isothermal atmosphere each gas would be in equilibrium with itself. The con- 
centration of each would fall off with a separate Boltzmann factor of the form exp(— Mgh/z), 
with M the appropriate molecular mass. Because of the differences in mass, the different 
constituents fall off at different rates. 


Example: Chemical potential of mobile magnetic particles in a magnetic field. Consider 
a system of N identical particles each with a magnetic moment m. For simplicity suppose 
each moment is directed either parallel t or antiparallel | to an applied magnetic field B. 
Then the potential energy of a ¢ particle is — mB, and the potential energy of a | particle is 
+mB. We may treat the particles as belonging to the two distinct chemical species labelled f 
and |, one with external chemical potential /,,,() = —mB and the other with y,,,(|) = mB. 
The particles f and | are as distinguishable as two different isotopes of an element or as two 
different elements; we speak of f and | as distinct species in equilibrium with each other. 
The internal chemical potentials of the particles viewed as ideal gases with concentrations 
n; and n, are 


HindT) = tlog(n,/ng); Hin 1) = tlog(n,/ng) . (21) 


where ng = (Mt/2zh?)*” is the same for both species. 
The total chemical potentials are 


HoT) = tlog(n,/ng) — mB; (22a) 
Mio) = T log(n,/ng) + mB. (22b) 
If the magnetic field B varies in magnitude over the volume of the system, the concentration 
ny must vary over the volume in order to maintain a constant total chemical potential j4,.,(7) 


over the volume (Figure 5.6). (The total chemical potential of a species is constant indepen- 
dent of position, if there is free diffusion of particles within the volume.) Because the two 
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Figure 5.6 Dependence of the chemical potential of a gas of magnetic 
particles on the concentration, at several values of the magnetic field 


intensity. Ifn = 2 x 10’ cm~$ for B = 0, then at a point where B = 20 
kilogauss (2 tesla) the concentration will be 2 x 10° cm~>. 


species in equilibrium have equal chemical potentials, 
Hyo(T) = constant = f4,,(J). (23) 
The desired solutions of (22) and (23) are easily seen by substitution to be: 
n,(B) = 4$n(O)exp(mB/t); —_n(B) = 4n(0)exp(—mB/r) , (24) 


where n(0) is the total concentration n; + n, at a point where the field B = 0. The total 
concentration at a point at magnetic field B is 


n(B) = n,(B) + n,(B) = 4n(0)[exp(mB/t) + exp(—mB/z)]; 


m?B? 
n(B) = n(0)cosh(mB/t) ~ mo(t + TED +: ). (25) 


The result shows the tendency of magnetic particles to concentrate in regions of high 
magnetic field intensity. The functional form of the result is not limited to atoms with two 
magnetic orientations, but is applicable to fine ferromagnetic particles in suspension in a 
colloidal solution. Such suspensions are used in the laboratory in the study of the magnetic 
flux structure of superconductors and the domain structure of ferromagnetic materials. In 
engineering, the suspensions are used to test for fine structural cracks in high strength steel, 
such as turbine blades and aircraft landing gear. When these are coated with a ferromagnetic 
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suspension and placed in a magnetic field, the particle concentration becomes enhanced at 
the intense fields at the edges of the crack. 

In the preceding discussion we added to y,,, the internal chemical potential of the 
particles. If the particles were ideal gas atoms, y;,, would be given by (12). The logarithmic 
form for j4;,; is not restricted to ideal gases, but is a consequence of the conditions that the 
particles do not interact and that their concentration is sufficiently low. Hence, (12) applies 
to macroscopic particles as well as to atoms that satisfy these assumptions. The only 
difference is the value of the quantum concentration ng. We can therefore write 


Hin = tlogn + constant , (26) 


where the constant (= —tlogng) does not depend on the concentration of the particles. 
SS SS Oo Se 


Example: Batteries. One of the most vivid examples of chemical potentials and potential 
steps is the electrochemical battery. In the familiar lead-acid battery the negative electrode 
consists of metallic lead, Pb, and the positive electrode is a layer of reddish-brown lead 
oxide, PbO,, on a Pb substrate. The electrodes are immersed in diluted sulfuric acid, 
H,SO,, which is partially ionized into H* ions (protons) and SO,~~ ions (Figure 5.7). 
It is the ions that matter. 

In the discharge process both the metallic Pb of the negative electrode and the PbO, 
of the positive electrode are converted to lead sulfate, PbSO,, via the two reactions:* 


Negative electrode: 
Pb + SO,~~ — PbSO, + 2e°; (27a) 


Positive electrode: 
PbO, + 2H* + H,SO,4 + 2e7 > PbSO, + 2H,0. (27b) 


Because of (27a) the negative electrode acts as a sink for SO,” ~ ions, keeping the internal 
chemical potential 4(SO,~ ~) of the sulfate ions at the surface of the negative electrode 
lower than inside the electrolyte (see Figure 5.7b). Similarly, because of (27b) the positive 
electrode acts as a sink for H* ions, keeping the internal chemical potential 4(H~*) of the 
hydrogen ions lower at the surface of the positive electrode than inside the electrolyte. The 
chemical potential gradients drive the ions towards the electrodes, and they drive the 
electrical currents during the discharge process. 

If the battery terminals are not connected, electrons are depleted from the positive 
electrode and accumulate in the negative electrode, thereby charging both. As a result, 
electrochemical potential steps develop at the electrode-electrolyte interfaces, steps of 
exactly the correct magnitude to equalize the chemical potential steps and to stop the 
diffusion of ions, which stops the chemical reactions from proceeding further. If an external 
current is permitted to flow, the reactions resume. Electron flow directly through the 
electrolyte is negligible, because of a negligible electron concentration in the electrolyte. 


* The reactions given are net reactions. The actual reaction steps are more complicated. 
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(—) electrode (+) electrode 


Ay(SO;~) 
1 


(c) 


Figure 5.7 (a) The lead-acid battery consists of a Pb and a PbO, electrode immersed in 
partially ionized H,SO,. One SO; ~ ion converts one Pb atom into PbSO, + 2e°: 
two H* ions plus one un-ionized H,SO, molecule convert one PbO, molecule into 
PbSO, + 2H,O, consuming two electrons. (b) The electrochemical potentials for SO; ~ 
and H* before the development of internal potential barriers that stop the diffusion 

and the chemical reaction. (c) The electrostatic potential g(x) after the formation of the 
barrier. 
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During the charging process the reactions opposite to (27a,b) take place, because now an 
external voltage is applied that generates electrostatic potential steps at the surface of the 
electrode of such magnitude as to reverse the sign of the (total) chemical potential gradients, 
and hence the direction of ion flow. 

We denote by AV_ and AV, the differences in electrostatic potential of the negative and 
positive electrodes relative to the common electrolyte. Because the sulfate ions carry two 
negative charges, diffusion will stop when 


—2qgAV_ = Ap(SO, ~). (28a) 
Diffusion of the H~ ions will stop when 
+qAV, = Au(H*). (28b) 


The two potentials AV_ and AV, are called half-cell potentials or half-cell EMF’s 
(electromotive forces); their magnitudes are known: 


AV_ = —04volt; AV, = +1.6volt. 


The total electrostatic potential difference developed across one full cell of the battery, 
as required to stop the diffusion reaction, is 


AV = AV, — AV_ = 20volt. (29) 


This is the open-circuit voltage or EMF of the battery. It drives the electrons from the nega- 
tive terminal to the positive terminal, when the two are connected. 

We have ignored free electrons in the electrolyte. The potential steps tend to drive 
electrons from the negative electrodes into the electrolyte, and from the electrolyte into the 
positive electrode. Such an electron current is present, but the magnitude is so small as to be 
practically negligible, because the concentration of electrons in the electrolyte is many 
orders of magnitude less than that of the ions. The only effective electron flow path is 
through the external connection between the electrodes. 
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In (5) we defined the chemical potential as a derivative of the Helmholtz free 
energy. Here we derive an alternate relation, needed later: 


H(U,V.N) _ -(§) : Go) 
Tt ON Juy 


This expresses the ratio y1/t as a derivative of the entropy, similar to the way 
1/t was defined in Chapter 2. 
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To derive (30), consider the entropy as a function of the independent variables 
U, V, and N. The differential 


oo do éo 
= — Vv = IN 1 
a (72), 4é” i a 2 (Fon! = 


gives the differential change of the entropy for arbitrary, independent differential 
changes dU, dV, and dN, Let dV = 0 for the processes under consideration. 
Further, select the ratios of do, dU, and dN in such a way that the overall 
temperature change dr will be zero. If we denote these interdependent values 
of do, dU, and dN by (6a),, (65U), and (6N),, then dt = 0 when 


0a da 
(60), = (52),00 + (55), 0% 
After division by (6N)., 
(6c), _ (é0\ (6U), 6a 
(6N), — eal (N), * (),- 22) 
laos! 


The ratio (da)./(ON), is (20/0N),, and (6U),/(6N), is (@U/ON),, all at constant 
volume. With the original definition of 1/t, we have 


oo 6uU a 
(es). ~ Ga), * ar), ea 


This expresses a derivative at constant U in terms of derivatives at constant t. 
By the original definition of the chemical potential, 


_ (aF\ _ fav aa 
ais se . Ese ‘i Ga), ae 


and on comparison with (33) we obtain 


————— 
| Hu = —1(60/0N)yy- (35) 


t 


The two expressions (5) and (35) represent two different ways to express the 
same quantity yp. The difference between them is the following. In (5), F is a 
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Table 5.1 Summary of relations expressing the temperature 
t, the pressure p, and the chemical potential y in terms of 
partial derivatives of the entropy a, the energy U, and the free 
energy F, with o, U, and F given as functions of their natural 
independent variables 


o(U,V.N) U(a,V,N) F(t,V,N) 
a | 1 a oo SP ou t is independent 
| t BU )y.n 66 J yx variable 


¥ i es (2) ny (f) 
*\GN) yy Fe NOR B*\aNn) 


function of its natural independent variables t, V, and N, so that appears as 
a function of the same variables. In (31) we assumed ¢ = o(U,V,N), so that 
(35) yields 4 as a function of U, V, N. The quantity yu is the same in both (5) 
and (35), but expressed in terms of different variables. The object of Problem 11 
is to find a third relation for pu: 


u(a,V,N) = (GU/ON),y . (36) 


and in Chapter 10 we derive a relation for y as a function oft, p, and N. Table 5.1 
compiles expressions for t, p, and ys as derivatives of ¢, U, and F. All forms 
have their uses. 


Thermodynamic identity. We can generalize the statement of the thermo- 
dynamic identity given in (3.34a) to include systems in which the number of 
particles is allowed to change. As in (31), 


oa oo do 
do =|) dU + (= + {(— i 
lo (33), (Fa (oath (37) 


By use of the definition (2.26) of 1/z, the relation (3.32) for p/z, and the relation 
(30) for —4/t, we write do as 


do = dU/t + pdV/t — pdN/c. (38) 
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This may be rearranged to give 


| dU = tdo — pdV + pdN-, | (39) 


which is a broader statement of the thermodynamic identity than we were able 
to develop in Chapter 3. 


GIBBS FACTOR AND GIBBS SUM 


The Boltzmann factor, derived in Chapter 3, allows us to give the ratio of the 
probability that a system will be in a state of energy ¢, to the probability the 
system will be in a state of energy ¢, for a system in thermal contact with a 
reservoir at temperature t: 


Plex) _ Sepzik} (40) 

P(é)  exp(—€2/T) 
This is perhaps the best known result of statistical mechanics. The Gibbs factor 
is the generalization of the Boltzmann factor to a system in thermal and diffusive 
contact with a reservoir at temperature t and chemical potential y. The argu- 
ment retraces much of that presented in Chapter 3. 

We consider a very large body with constant energy U, and constant particle 
number No. The body is composed of two parts, the very large reservoir ® and 
the system 8, in thermal and diffusive contact (Figure 5.8). They may exchange 
particles and energy. The contact assures that the temperature and the chemical 
potential of the system are equal to those of the reservoir. When the system has 
N particles, the reservoir has Ng — N particles; when the system has energy ¢, 
the reservoir has energy Uy — «. To obtain the statistical properties of the 
system, we make observations as before on identical copies of the system + 
reservoir, one copy for each accessible quantum state of the combination. What 
is the probability in a given observation that the system will be found to con- 
tain N particles and to be in a state s of energy ¢,? 

The state s is a state of a system having some specified number of particles. 
The energy é,y) is the energy of the state s of the N-particle system; sometimes 
we write only ¢,, if the meaning is clear. When can we write the energy of a 
system having N particles in an orbital as N times the energy of one particle 
in the orbital? Only when interactions between the particles are neglected, so 
that the particles may be treated as independent of each other. 
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Figure 5.8 A system in thermal and diffusive contact with 
a large reservoir of energy and of particles. The total system 
® + Sis insulated from the external world, so that the 
total energy and the total number of particles are constant. 
The temperature of the system is equal to the temperature 
of the reservoir, and the chemical potential of the system is 
equal to the chemical potential of the reservoir. The system 
may be as small as one atom or it may be macroscopic, but 
the reservoir is always to be thought of as much larger than 
the system. 


Let P(N,¢,) denote the probability that the system has N particles and is in 
a particular state s. This probability is proportional to the number of accessible 
states of the reservoir when the state of the system is exactly specified. That 
is, if we specify the state of 8, the number of accessible states of R + 8 is just 
the number of accessible states of R: 


g(R + 8) = g(R) x 1. (41) 


The factor 1 reminds us that we are looking at the system 8 ina single specified 
state. The g(®) states of the reservoir have Ng — N particles and have energy 
Uo — é,. Because the system probability P(N,¢,) is proportional to the number 
of accessible states of the reservoir, 


P(N,e5) & g(No — N.Uo — &,). (42) 
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Figure 5.9 The reservoir is in thermal and diffusive contact with the system. In (a) the 
system is in quantum state 1, and the reservoir has g(No — N,, Uo — &) states accessible 
to it. In (b) the system is in quantum state 2, and the reservoir has g(No — N2, Up — €2) 
states accessible to it. Because we have specified the exact state of the system, the total 
number of states accessible to R + 8 is just the number of states accessible to R. 


Here g refers to the reservoir alone and depends on the number of particles 
in the reservoir and on the energy of the reservoir. 

We can express (42) as a ratio of two probabilities, one that the system is in 
state 1 and the other that the system is in state 2: 


P(N,,€:) _ g(No — Ni,Uo — &) 
P(N2,£2) g(No — N2,Uo — &2) ” 


(43) 


where g refers to the state of the reservoir. The situation is shown in Figure 5.9. 
By definition of the entropy 
9(No,Uo) = exp[a(No,Uo)] . (44) 


so that the probability ratio in (43) may be written as 


P(N4€1) _ exp[o(No — Ny,Uo — &)]. 
P(N2,€2) — explo(No — N2,Uo — &2)]” 


(45) 


or 


P(N ,,€1) 


———_ = exp[a(No — Ny,U9 — €:) — o(No — N2,Ug — €. 
P(N>,€2) PLo(No Yo 1) ° 2Uo 2)] 


= exp(Ac). (46) 


Gibbs Factor and Gibbs Sum 


Here, Aq is the entropy difference: 
Ao = a(No — Ny,Uo — &) — o(No — N2,Uo — &2). (47) 


The reservoir is very large in comparison with the system, and Ac may be 
approximated quite accurately by the first order terms in a series expansion in 
the two quantities N and « that relate to the system. The entropy of the reservoir 
becomes 


o(Ng — N,Ug — 8) = o(NoUo) — nar.) = (sa) 4%, (48) 
ONo]v. 


For Ao defined by (47) we have, to the first order in N, — N2 and ine, — é, 


oo a 
Ao = -(N, — wa), — (& - aan), (49) 
We know that 
1 oo 
7*(), - 


by our original definition of the temperature. This is written for the reservoir, 
but the system will have the same temperature. Also, 


20 (i) ; (50b) 


by (30). 
The entropy difference (49) is 


(N; — No) _ (&: — £2) 
t = 


Ao = (51) 


Here Ac refers to the reservoir, but N;, N>, €,€2 refer to the system. The central 
result of statistical mechanics is found on combining (46) and (51): 


| ] 
| P(N,,€,) — exp[(Nyu— e)/t] | 
P(N2,€2) — exp[(N2u — é2)/t] 


(52) 
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The probability is the ratio of two exponential factors, each of the form 
exp[(Nu — «)/t]. A term of this form is called a Gibbs factor. The Gibbs factor 
is proportional to the probability that the system is in a state s of energy ¢, 
and number of particles N. The result was first given by J. W. Gibbs, who 
referred to it as the grand canonical distribution. 

The sum of Gibbs factors, taken over all states of the system for all numbers 
of particles, is the normalizing factor that converts relative probabilities to 
absolute probabilities: 


| gt) = NS y exp[(Nu — eyyy)/t] = > exp[(Nu — eyn))/T]- (33) 
| ASN 


N=0 s(N) 


This is called the Gibbs sum, or the grand sum, or the grand partition function. 
The sum is to be carried out over all states of the system for all numbers of 
particles: this defines the abbreviation ASN. We have written ¢, as ey) to 
emphasize the dependence of the state on the number of particles N. That is, 
yyy is the energy of the state s(N) of the exact N-particle hamiltonian. The 
term N = 0 must be included; if we assign its energy as zero, then the first 
term in 3 will be 1. 

The absolute probability that the system will be found in a state Nj, e, is 
given by the Gibbs factor divided by the Gibbs sum: 


exp[(N i — &:)/t] 


& 


This applies to a system that is at temperature t and chemical potential yu. The 
ratio of any two P’s is consistent with our central result (52) for the Gibbs 
factors. Thus (52) gives the correct relative probabilities for the states Ny, &; 
and N>, ¢2. The sum of the probabilities of all states for all numbers of particles 
of the system is unity: 


P(N,,€1) = (54) 


¥ exp[(Nu — eyy)/t] 


YY PN) = ¥ P(N) = SS eae 
Ns ASN og & 


by the definition of 3. Thus (54) gives the correct absolute probability.* 


* Readers interested in probability theory will find Appendix C on the Poisson distribution to be 
particularly helpful. The method used there to derive the Poisson distribution depends on the 
Gibbs sum. See also Problem (6.13). 


Gibbs Factor and Gibbs Sum 


Average values over the systems in diffusive and thermal contact with a 
reservoir are easily found. If X(N,s) is the value of X when the system has N 
particles and is in the quantum state s, then the thermal average of X over all 
N and all s is 


Y X(N,s)exp[(Nu — ¢,)/t] 
{X) = Y X(N,s)P(N,e,) = 
AN $ 


We shall use this result to calculate thermal averages. 


(56) 


Number of particles. The number of particles in the system can vary because 
the system is in diffusive contact with a reservoir. The thermal average of the 
number of particles in the system is 


¥ Nexp[(Nu — ¢,)/t] 


<N) = S*_—________ (57) 
é 


according to (56). To obtain the numerator, each term in the Gibbs sum has 
been multiplied by the appropriate value of N. More convenient forms of <N> 
can be obtained from the definition of 3 : 


age 1 
F=-YN Nu — &,)/t] , 58 
an 5 oe exp[(Nu — ¢,)/7] (58) 
whence 
_ 1066 _ Glogs 
a Ae oar si 


The thermal average number of particles is easily found from the Gibbs sum 
& by direct use of (59). When no confusion arises, we shall write N for the 
thermal average <N). When we speak later of the occupancy of an orbital, 
f or <f> will be written interchangeably for N or <N). 

We often employ the handy notation 


| 2 = exp(u/2) . (60) 
= eee 


where 2 is called the absolute activity. Here / is the Greek letter lambda. We 
see from (12) that for an ideal gas / is directly proportional to the concentration. 
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Chapter 5: Chemical Potential and Gibbs Distribution 
The Gibbs sum is written as 


B= LEA exp(—«,/t) = Y A exp(—«,/2) , (61) 
Ns ASN 


and the ensemble average number of particles (57) is 


é 


Cn 


<N> = 4 logg. (62) 


This relation is useful, because in many actual problems we determine / by 
finding the value that will make <N) come out equal to the given number of 
particles. 


Energy. The thermal average energy of the system is 


Y esexp[B(Nu — 4) 
ASN 


U =e) = —— P (63) 


where we have temporarily introduced the notation 8 = 1/z. We shall usually 
write U for <>. Observe that 


be i tte as 
(Np #) = (Nu U =o SF = long (64) 


so that (59) and (63) may be combined to give 
va fet - Sho oe (65) 
au 0p) 8S =\" 5, aay ee 


A simpler expression that is more widely used in calculations was obtained in 
Chapter 3 in terms of the partition function Z. 


GP Se SS SS SS See eee 


Example: Occupancy zero or one. A red-blooded example of a system that may be 
occupied by zero molecules or by one molecule is the heme group, which may be vacant or 
may be occupied by one O, molecule—and never by more than one O, molecule (Figure 
5.10). A single heme group occurs in the protein myoglobin, which is responsible for the 
red color of meat. If ¢ is the energy of an adsorbed molecule of O, relative to O, at rest at 


Gibbs Factor and Gibbs Sum 141 


—_—_—_"—"e 
@ Figure 5.10 Adsorption of an O, by a heme, 
? where ¢ is the energy of an adsorbed O, relative 
to an O, at infinite separation from the site. 
If energy must be supplied to detach the O, 
0 from the heme, then ¢ will be negative. 


infinite distance, then the Gibbs sum is 
go = 1 + Aexp(—e/2). (66) 


If energy must be added to remove the atom from the heme, ¢ will be negative. The term 1 
in the sum arises from occupancy zero; the term / exp(—<¢/t) arises from single occupancy. 
These are the only possibilities. We have Mb + O, or MbO, present, where Mb denotes 
myoglobin, a protein of molecular weight 17 000. 

Experimental results for the fractional occupancy versus the concentration of oxygen 
are shown in Figure 5.11. We compare the observed oxygen saturation curves of myoglobin 
and hemoglobin in Figure 5.12. Hemoglobin is the oxygen-carrying component of blood. 
It is made up of four molecular strands, each strand nearly identical with the single strand of 
myoglobin, and cach capable of binding a single oxygen molecule. Historically, the classic 
work on the adsorption of oxygen by hemoglobin was done by Christian Bohr, the father of 
Niels Bohr. The oxygen saturation curve for hemoglobin (Hb) has a slower rise at low 
pressures, because the binding energy of a single O, to a molecule of Hb is lower than for 
Mb. At higher pressures of oxygen the Hb curve has a region that is concave upwards, 
because the binding energy per O, increases after the first O, is adsorbed. 

The O, molecules on hemes are in equilibrium with the O, in the surrounding liquid, so 
that the chemical potentials of O, are equal on the myoglobin and in solution: 


H(MbO3) = (O02); A(MbO2) = 4(O2) (67) 


where A = exp(1/t). From Chapter 3 we find the value of / in terms of the gas pressure 
by the relation 


2 = n/ng = p/tng. (68) 
We assume the ideal gas result applies to O, in solution. At constant temperature 4(O,) is 


directly proportional to the pressure p. 
The fraction f of Mb occupied by O, is found from (66) to be 


a soe) a 1 ; (69) 
1+ Aexp(—e/t) A>‘ exp(e/t) + 1 


i 


Figure 5.11 The reaction of a myoglobin 
(Mb) molecule with oxygen may be viewed as 
the adsorption of a molecule of O, at a site 
on the large myoglobin molecule. The 
results follow a Langmuir isotherm quite 
accurately. Each myoglobin molecule can 
adsorb one O, molecule. These curves show 
the fraction of myoglobin with adsorbed O, 
as a function of the partial pressure of O. 
The curves are for human myoglobin in 
solution. Myoglobin is found in muscles; it is 
responsible for the color of steak. After A. 
Rossi-Fanelli and E. Antonini, Archives of 
Biochemistry and Biophysics 77, 478 (1958). 


Figure 5.12 Saturation curves of O, bound 
to myoglobin (Mb) and hemoglobin (Hb) 
molecules in solution in water. The partial 
pressure of O, is plotted as the horizontal 
axis. The vertical axis gives the fraction of 

the molecules of Mb which has one bound 

O, molecule, or the fraction of the strands of 
Hb which have one bound O, molecule. 
Hemoglobin has a much larger change in 
oxygen content in the pressure range between 
the arteries and the veins. This circumstance 
facilitates the action of the heart, viewed as a 
pump. The curve for myoglobin has the 
predicted form for the reaction Mb + O, 
MbO,. The curve for hemoglobin has a 
different form because of interactions between 
O, molecules bound to the four strands of the 
Hb molecule. The drawing is after J. S. Fruton 
and S. Simmonds, General biochemistry, Wiley, 
1961. 
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Gibbs Factor and Gibbs Sum 


which is the same as the Fermi-Dirac distribution function derived in Chapter 7. We 
substitute (68) in (69) to obtain 


1 P 
(ngt/p)exp(e/t) + 1 ngtexp(e/t) + p” 


(70) 


or, with pp = Ngt exp(e/t), 


|i eyes yal (71) 


where po is constant with respect to pressure, but depends on the temperature. The result 
(71) is known as the Langmuir adsorption isotherm when used to describe the adsorption of 
gases on the surfaces of solids. 


a eS Ee SS 


Example: Impurity atom ionization in a semiconductor. Atoms of numerous chemical 
elements when present as impurities in a semiconductor may lose an electron by ionization 
to the conduction band of the semiconductor crystal. In the conduction band the electron 
moves about much as if it were a free particle, and the electron gas in the conduction band 
may often be treated as an ideal gas. The impurity atoms are small systems 3 in thermal 
and diffusive equilibrium with the large reservoir Tormed by The rest of the semiconductor: 
the atoms exchange electrons and energy with the semiconductor. 

Let I be the ionization energy of the impurity atom. We suppose that one, but only one, 
electron can be bound to an impurity atom; either orientation f or | of the electron spin 
is accessible. Therefore the system 8 has three allowed states—one without an electron, 
one with an electron attached with spin f, and one with an electron attached with spin |. 
When §$ has zero electrons, the impurity atom is ionized. We choose the zero ofenergy of 8 
as this state: the other two states therefore have the common energy ¢ = —/. The accessible 
states of 8 are summarized below. 


State number Description N & 
1 Electron detached 0 0 
) Electron attached, spin f ¥ -I 
3 Electron attached, spin | 1 -—I 


The Gibbs sum is given by 


3 = 14 2exp[(u + D/t]. (72) 
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The probability that 8 is ionized (N = 0) is 


1 


P(ionized) = P(0,0) = ! 


= 3 2 
& 1 + 2expl(u + Diz] ae 

The probability that 8 is neutral (un-ionized) is 
P(neutral) = P(1f,—1) + PUJ,-J) . (74) 


which is just 1 — P(0,0). 
SS a ee Se eee 


SUMMARY 


1. The chemical potential is defined as y(t,V,N) = (OF /ON),.y and may also be 
found from yx = (@U/ON),,y = —t(@a/0N)y,y. Two systems are in diffusive 
equilibrium if uy, = pl. 


2. The chemical potential is made up of two parts, external and internal. The 
external part is the potential energy of a particle in an external field of force. 
The internal part is of thermal origin; for an ideal monatomic gas j(int) = 
t log(n/ng), where n is the concentration and ng = (Mz/2nh?)°? is the quan- 
tum concentration. 


3. The Gibbs factor 
P(N) = exp[(Nu — «,)/t]/F 


gives the probability that a system at chemical potential « and temperature t 
will have N particles and be in a quantum state s of energy ¢,. 


4. The Gibbs sum 


& = Y expl(Nu — exw)/t] 
ASN 


is taken over all states for all numbers of particles. 
5. The absolute activity 4 is defined by 4 = exp(i/2). 


6. The thermal average number of particles is 


Pe! 
N= Ax log g- 


Problems 


PROBLEMS 


1. Centrifuge. A circular cylinder of radius R rotates about the long axis with 
angular velocity «. The cylinder contains an ideal gas of atoms of mass M at 
temperature t. Find an expression for the dependence of the concentration 
n(r) on the radial distance r from the axis, in terms of n(0) on the axis. Take 
was for an ideal gas. 


2. Molecules in the Earth’s atmosphere. \fnis the concentration of molecules 
at the surface of the Earth, M the mass of a molecule, and g the gravitational 
acceleration at the surface, show that at constant temperature the total number 
of molecules in the atmosphere is 


N = 4nn(R)exp(—MgR/z) 1h dr? exp(MgR?2/rz) , (75) 


with r measured from the center of the Earth; here R is the radius of the Earth. 
The integral diverges at the upper limit, so that N cannot be bounded and the 
atmosphere cannot be in equilibrium. Molecules, particularly light molecules, 
are always escaping from the atmosphere. 


3. Potential energy of gas in gravitational field. Consider a column of atoms 
each of mass M at temperature t in a uniform gravitational field g. Find the 
thermal average potential energy per atom. The thermal average kinetic energy 
density is independent of height. Find the total heat capacity per atom. The 
total heat capacity is the sum of contributions from the kinetic energy and from 
the potential energy. Take the zero of the gravitational energy at the bottom 
h = 0 of the column. Integrate from h = 0 toh = o. 


4. Active transport. The concentration of potassium K * ions in the internal 
sap ofa plant cell (for example, a fresh water alga) may exceed by a factor of 10* 
the concentration of K* ions in the pond water in which the cell is growing. 
The chemical potential of the K* ions is higher in the sap because their con- 
centration n is higher there. Estimate the difference in chemical potential at 
300 K and show that it is equivalent to a voltage of 0.24 V across the cell wall. 
Take y as for an ideal gas. Because the values of the chemical potentials are 
different, the ions in the cell and in the pond are not in diffusive equilibrium. The 
plant cell membrane is highly impermeable to the passive leakage of ions 
through it. Important questions in cell physics include these: How is the high 
concentration of ions built up within the cell? How is metabolic energy applied 
to energize the active ion transport? 


5. Magnetic concentration. Determine the ratio m/t for which Figure 5.6 is 
drawn. If T = 300K, how many Bohr magnetons pig = eh/2mc would the 
particles contain to give a magnetic concentration effect of the magnitude 
shown? 
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6. Gibbs sum for a two level system. (a) Consider a system that may be un- 
occupied with energy zero or occupied by one particle in either of two states, 
one of energy zero and one of energy ¢. Show that the Gibbs sum for this system is 


G=1L+A+ Lexp(—e/z). (76) 
Our assumption excludes the possibility of one particle in each state at the same 
time. Notice that we include in the sum a term for N = Oasa particular state ofa 
system of a variable number of particles. 


(b) Show that the thermal average occupancy of the system is 


2+ Aexp(—é/t) 


<N> = (77) 
é 
(c) Show that the thermal average occupancy of the state at energy ¢ is 
<N(e)> = Aexp(—e/1)/3- (78) 


(d) Find an expression for the thermal average energy of the system. 
(e) Allow the possibility that the orbital at 0 and at ¢ may be occupied each by 
one particle at the same time; show that 


G =1+4 + dexp(—e/t) + 2H exp(—e/z) = (1 + AL + Aexp(—e/z)]. (79) 


Because 3 can be factored as shown, we have in effect two independent systems. 


7. States of positive and negative ionization. Consider a lattice of fixed hy- 
drogen atoms; suppose that each atom can exist in four states: 


State Number of electrons Energy 
Ground 1 —4A 
Positive ion 0 —46 
Negative ion 2 45 
Excited 1 4A 


Find the condition that the average number of electrons per atom be unity. 
The condition will involve 6, 2, and t. 


8. Carbon monoxide poisoning. In carbon monoxide poisoning the CO 
replaces the O, adsorbed on hemoglobin (Hb) molecules in the blood. To show 
the effect, consider a model for which each adsorption site on a heme may be 
vacant or may be occupied either with energy ¢, by one molecule O, or with 
energy &, by one molecule CO. Let N fixed heme sites be in equilibrium with 


Problems 


O, and CO in the gas phases at concentrations such that the activities are 
A(Oz) = 1 x 1075 and 4(CO) = 1 x 107’, all at body temperature 37°C. 
Neglect any spin multiplicity factors. (a) First consider the system in the absence 
of CO. Evaluate ¢, such that 90 percent of the Hb sites are occupied by O2. 
Express the answer in eV per O,. (b) Now admit the CO under the specified 
conditions. Find ég such that only 10 percent of the Hb sites are occupied by O2. 


9. Adsorption of O, in a magnetic field. Suppose that at most one O, can be 
bound to a heme group (see Problem 8), and that when 4(O2) = 1075 we have 
90 percent of the hemes occupied by O. Consider O, as having a spin of 1 
anda magnetic moment of 1 j1g. How strong a magnetic field is needed to change 
the adsorption by 1 percent at T = 300K? (The Gibbs sum in the limit of zero 
magnetic field will differ from that of Problem 8 because there the spin multi- 
plicity of the bound state was neglected.) 


10. Concentration fluctuations. The number of particles is not constant in a 
system in diffusive contact with a reservoir. We have seen that 


t (03 
N=s|(— 5 (80) 
um =3() 
from (59). (a) Show that 
2 A2: 
<N2) = Pees (81) 


The mean-square deviation <(AN)?> of N from <N) is defined by 


<(AN)*) = <(N — <N))?) = <N?) — 2N)<ND + CN)? = CN?) — (ND? 


arf B PRY} 


(b) Show that this may be written as 


<(AN)?> = t0¢N)/6n. (83) 
In Chapter 6 we apply this result to the ideal gas to find that 


(AN?) _ 1 
<Ny? ~ “ND 


(84) 


is the mean square fractional fluctuation in the population of an ideal gas in 
diffusive contact with a reservoir. If <> is of the order of 107° atoms, then the 


147 


148 


Chapter 5: Chemical Potential and Gibbs Distribution 


fractional fluctuation is exceedingly small. In such a system the number of 
particles is well defined even though it cannot be rigorously constant because 
diffusive contact is allowed with the reservoir. When <N) is low, this relation 
can be used in the experimental determination of the molecular weight of large 
molecules such as DNA of MW 10° — 10!°; see M. Weissman, H. Schindler, 
and G, Feher, Proc. Nat. Acad. Sci. 73, 2776 (1976). 


11. Equivalent definition of chemical potential. The chemical potential was 
defined by (5) as (@F/0N),,y. An equivalent expression listed in Table 5.1 is 


H = (0U/EN), y- (85) 


Prove that this relation, which was used by Gibbs to define yu, is equivalent to 
the definition (5) that we have adopted. It will be convenient to make use of the 
results (31) and (35). Our reasons for treating (5) as the definition of 1, and (85) as 
a mathematical consequence, are two-fold. In practice, we need the chemical 
potential more often as a function of the temperature t than as a function of the 
entropy o. Operationally, a process in which a particle is added to a system 
while the temperature of the system is kept constant is a more natural process 
than one in which the entropy is kept constant: Adding a particle to a system at 
a finite temperature tends to increase its entropy unless we can keep each system 
of the ensemble in a definite, although new, quantum state. There is no natural 
laboratory process by which this can be done. Hence the definition (5) or (6), 
in which the chemical potential is expressed as the change in free energy per 
added particle under conditions of constant temperature, is operationally the 
simpler. We point out that (85) will not give U = uN on integration, because 
H(N,o,V) is a function of N; compare with (9.13). 


12. Ascent of sap in trees. Find the maximum height to which water may rise 
in a tree under the assumption that the roots stand in a pool of water and the 
uppermost leaves are in air containing water vapor at a relative humidity r = 
0.9. The temperature is 25°C. If the relative humidity is r, the actual concentra- 
tion of water vapor in the air at the uppermost leaves is rng, where ng is the 
concentration in the saturated air that stands immediately above the pool of 
water. 


13. Isentropic expansion. (a) Show that the entropy of an ideal gas can be 
expressed as a function only of the orbital occupancies. (b) From this result 
show that tV?/ is constant in an isentropic expansion of an ideal monatomic 
gas. 


14. Multiple binding of O,. A hemoglobin molecule can bind four O, 
molecules. Assume that ¢ is the energy of each bound O), relative to O, at rest at 
infinite distance. Let 2 denote the absolute activity exp(/z) of the free O, (in 
solution). (a) What is the probability that one and only one O, is adsorbed ona 


Problems 


hemoglobin molecule? Sketch the result qualitatively as a function of J. (b) What 
is the probability that four and only four O, are adsorbed? Sketch this result 
also. 


15, External chemical potential. Consider a system at temperature 7, with 
N atoms of mass M in volume V. Let (0) denote the value of the chemical 
potential at the surface of the earth. (a) Prove carefully and honestly that the 
value of the total chemical potential for the identical system when translated 
to altitude h is 


u(h) = 2(0) + Mgh , 


where g is the acceleration of gravity. (b) Why is this result different from that 
applicable to the barometric equation of an isothermal atmosphere? 
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Chapter 6: Ideal Gas 


The ideal gas is a gas of noninteracting atoms in the limit of low concentration. 
The limit is defined below in terms of the thermal average value of the number 
of particles that occupy an orbital. The thermal average occupancy is called the 
distribution function, usually designated as f(é,t,u), where ¢ is the energy of 
the orbital. 

An orbital is a state of the Schrédinger equation for only one particle. This 
term is widely used particularly by chemists. If the interactions between particles 
are weak, the orbital model allows us to approximate an exact quantum state 
of the Schrédinger equation of a system of N particles in terms of an approxi- 
mate quantum state that we construct by assigning the N particles to orbitals, 
with each orbital a solution of a one-particle Schrédinger equation. There are 
usually an infinite number of orbitals available for occupancy. The term 
“orbital” is used even when there is no analogy to a classical orbit or to a 
Bohr orbit. The orbital model gives an exact solution of the N-particle problem 
only if there are no interactions between the particles, 

It is a fundamental result of quantum mechanics (the derivation of which 
would lead us astray here) that all species of particles fall into two distinct 
classes, fermions and bosons. Any particle with half-integral spin is a fermion, 
and any particle with zero or integral spin is a boson. There are no interme- 
diate classes. Composite particles follow the same rule: an atom of 3He is 
composed of an odd number of particles—2 electrons, 2 protons, 1 neutron— 
each of spin }, so that *He must have half-integral spin and must be a fermion. 
An atom of *He has one more neutron, so there are an even number of particles 
of spin 4, and *He must be a boson. 

The fermion or boson nature of the particle species that make up a many- 
body system has a profound and important effect on the states of the system. 
The results of quantum theory as applied to the orbital model of noninteracting 
particles appear as occupancy rules: 


1. An orbital can be occupied by any integral number of bosons of the same 
species, including zero. 
2. An orbital can be occupied by 0 or 1 fermion of the same species. 


The second rule is a statement of the Pauli exclusion principle. Thermal averages 
of occupancies need not be integral or half-integral, but the orbital occupancies 
of any individual system must conform to one or the other rule. 


Fermi-Dirac Distribution Function 


The two different occupancy rules give rise to two different Gibbs sums for 
each orbital: there is a boson sum over all integral values of the orbital occu- 
pancy N, and there is a fermion sum in which N = 0 or N = 1 only. Different 
Gibbs sums lead to different quantum distribution functions f(¢,t,u) for the 
thermal average occupancy. If conditions are such that f « 1, it will not matter 
whether the occupancies N = 2, 3,... are excluded or are allowed. Thus when 
f <« 1 the fermion and boson distribution functions must be similar. This limit 
in which the orbital occupancy is small in comparison with unity is called the 
classical regime. 

We now treat the Fermi-Dirac distribution function for the thermal avérage 
occupancy of an orbital by fermions and the Bose-Einstein distribution function 
for the thermal average occupancy of an orbital by bosons. We show the 
equivalence of the two functions in the limit of low occupancy, and we go on 
to treat the properties of a gas in this limit. In Chapter 7 we treat the properties 
of fermion and boson gases in the opposite limit, where the nature of the 
particles is absolutely crucial for the properties of the gas. 


FERMI-DIRAC DISTRIBUTION FUNCTION 


We consider a system composed of a single orbital that may be occupied by a 
fermion. The system is placed in thermal and diffusive contact with a reservoir, 
as in Figures 6.1 and 6.2. A real system may consist of a large number No of 
fermions, but it is very helpful to focus on one orbital and call it the system. 
All other orbitals of the real system are thought of as the reservoir. Our problem 
is to find the thermal average occupancy of the orbital thus singled out. An 
orbital can be occupied by zero or by one fermion. No other occupancy is 
allowed by the Pauli exclusion principle. The energy of the system will be taken 
to be zero if the orbital is unoccupied. The energy is ¢ if the orbital is occupied 
by one fermion. 
The Gibbs sum now is simple: from the definition in Chapter 5 we have 


G = 1 + Aexp(—e/7). (1) 


The term 1 comes from the configuration with occupancy N = 0 and energy 
¢ = 0. The term Aexp(—e/t) comes when the orbital is occupied by one fermion, 
so that N = 1 and the energy is ¢. The thermal average value of the occupancy 
of the orbital is the ratio of the term in the Gibbs sum with N = 1 to the entire 
Gibbs sum: 

Aexp(—é/t) 1 


<N(e)> 1 + Aexp(—e/t) F A exp(e/t) + 1 @) 
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Reservoir Reservoir 
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co Occupied 


0 fermions 1 fermion 
0 energy © energy 
Ny fermions Ny — | fermions 
Uy energy Uy — © energy 
8(No Uo) g(Ny — 1, Uy — &) 
0(No Up) = log g(Ny Up) a(Ng — 1, Uy —€) = 0(No, Up) =(a))—"(a0), 


= 0(No, Us) + - -< 

Figure 6.1 We consider as the system a single orbital that may be occupied at most by one 
fermion. The system is in thermal and diffusive contact with the reservoir at temperature t. The 
energy ¢ of the occupied orbital might be the kinetic energy of a free electron of a definite 

spin orientation and confined to a fixed volume. Other allowed quantum states may be 
considered as forming the reservoir. The reservoir will contain No fermions if the system is 
unoccupied and Ng — | fermions if the system is occupied by one fermion. 


We introduce for the average occupancy the conventional symbol f(e) that 


denotes the thermal average number of particles in an orbital of energy e: 


Sle) = <N(e)>- 


Recall from Chapter 5 that 2 = exp(/t), where j is the chemical potential. 


We may write (2) in the standard form 


| ; 
| 40 = apie — wal +1" 
l 


This result is known as the Fermi-Dirac distribution function.* Equation (4) 
gives the average number of fermions in a single orbital of energy «. The value 


* This distribution function was discovered independently by E. Fermi, Zeitschrift fiir Physik 36 
902 (1926), and P. A. M. Dirac, Proceedings of the Royal Society of London A112, 661 (1926). 
Both workers drew on Pauli’s paper of the preceding year in which the exclusion principle was 
discovered. The paper by Dirac is concerned with the new quantum mechanics and contains a 


general statement of the form assumed by the Pauli principle on this theory. 


Fermi-Dirac Distribution Function 


tot 
gy System 
System 
Reservoir 
(a) (b) 


Figure 6.2 (a) The obvious method of viewing a system of noninteracting particles is 
shown here. The energy levels each refer to an orbital that is a solution of a single- 
particle Schrédinger equation. The total energy of the system is 


fot = Nate » 


where N, is the number of particles in the orbital n of energy ¢,. For fermions N, = 0 
or 1. (b) It is much simpler than (a), and equally valid, to treat a single orbital as the 
system. The system in this scheme may be the orbital n of energy ¢,. All other orbitals 
are viewed as the reservoir. The total energy of this one-orbital system is N,¢,, Where 
N, is the number of particles in the orbital. This device of using one orbital as the 
system works because the particles are supposed to interact only weakly with each 
other. If we think of the fermion system associated with the orbital n, these are two 
possibilities: either the system has 0 particles and energy 0, or the system has 1 particle 
and energy ¢,. Thus, the Gibbs sum consists of only two terms: 


& = 1+ Aexp(—¢,/2). 


The first term arises from the orbital occupancy N,, = 0, and the second term arises 
from N, = 1. 


of f always lies between zero and one. The Fermi-Dirac distribution function 
is plotted in Figure 6.3. 

In the field of solid state physics the chemical potential yu is often called the 
Fermi level. The chemical potential usually depends on the temperature. The 
value of at zero temperature is often written as ¢,; that is, 


H(t = 0) = p(0) = er. (5) 
We call ¢, the Fermi energy, not to be confused* with the Fermi level which 


* In the semiconductor literature the symbol é- is often used for y at any temperature, and ¢, is 
then called the Fermi level. 
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Figure 6.3 Plot of the Fermi-Dirac distribution function f(e) versus e — jin units of 
the temperature t. The value of f(2) gives the fraction of orbitals at a given energy 
which are occupied when the system is in thermal equilibrium. When the system is 
heated from absolute zero, fermions are transferred from the shaded region at e/ < 1 
to the shaded region at ¢/u > 1. For conduction electrons in a metal, 4 might 
correspond to 50 000 K. 


is the temperature dependent y(t). Consider a system of many independent 
orbitals, as in Figure 6.4. At the temperature t = 0, all orbitals of energy below 
the Fermi energy are occupied by exactly one fermion each, and all orbitals 
of higher energy are unoccupied. At nonzero temperatures the value of the 
chemical potential , departs from the Fermi energy, as we will see in Chapter 7. 

If there is an orbital of energy equal to the chemical potential (¢ = 1), the 
orbital is exactly half-filled, in the sense of a thermal average: 


1 
AE) lerrrare (6) 


Orbitals of lower energy are more than half-filled, and orbitals of higher energy 
are less than half-filled. 

We shall discuss the physical consequences of the Fermi-Dirac distribution 
in Chapter 7. Right now we go on to discuss the distribution function of non- 


Bose-Einstein Distribution Function 


interacting bosons, and then we establish the ideal gas law for both fermions 
and bosons in the appropriate limit. 


BOSE-EINSTEIN DISTRIBUTION FUNCTION 


A boson is a particle with an integral value of the spin. The occupancy rule 
for bosons is that an orbital can be occupied by any number of bosons, so that 
bosons have an essentially different quality than fermions. Systems of bosons 
can have rather different physical properties than systems of fermions. Atoms 
of *He are bosons; atoms of *He are fermions. The remarkable superfluid 
properties of the low temperature (T < 2.17K) phase of liquid helium can be 
attributed to the properties of a boson gas. There is a sudden increase in the 
fluidity and in the heat conductivity of liquid *He below this temperature. In 
experiments by Kapitza the flow viscosity of *He below 2.17 K was found to 
be less than 10~” of the viscosity of the liquid above 2.17 K. 

Photons (the quanta of the electromagnetic field) and phonons (the quanta 
of elastic waves in solids) can be considered to be bosons whose number is 
not conserved, but it is simpler to think of photons and phonons as excitations 
of an oscillator, as we did in Chapter 4. 

We consider the distribution function for a system of noninteracting bosons 
in thermal and diffusive contact with a reservoir. We assume the bosons are 
all of the same species. Let ¢ denote the energy of a single orbital when occupied 
by one particle; when there are N particles in the orbital, the energy is Ne, 
as in Figure 6.5. We treat one orbital as the system and view all other orbitals 
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Figure 6.4 A convenient pictorial way to 
think of a system composed of independent 
orbitals that do not interact with each other, 
but interact with a common reservoir. 
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Figure 6.5 Energy-level scheme for non- 
interacting bosons. Here ¢ is the energy of 
an orbital when occupied by one particle: N=3 
Ne is the energy of the same orbital when 

occupied by N particles. Any number of bosons 

can occupy the same orbital. The lowest level N=2 

of this orbital contributes a term | to the 4e 
grand sum; the next highest level contributes 

4 exp(—e/t); and the subsequent contributions N= 3e 

are /? exp(—2a/r); 2° exp(—3e/z); 24 exp( —4e/t): —— 2e 

and so on. The Gibbs sum is @ = 1 + 

Aexp(—e/t) + A? exp(—2e/t) + +>. 


as part of the reservoir. Any arbitrary number of particles may be in the orbital. 
The Gibbs sum taken for the orbital is 


x 


¢= y AN exp(—Ne/t) = ¥ [Aexp(—e/z)]*. (7) 
N=0 


N=0 
The upper limit on N should be the total number of particles in the combined 
system and reservoir. However, the reservoir may be arbitrarily large, so that 


N may run from zero to infinity. The series (7) may be summed in closed form. 
Let x = Aexp(—e/z); then 


1 1 


tate = Aexp(—8/t) ® 


on oe = 


provided that A exp(—e/t) < 1. In all applications, Aexp(—¢/t) will satisfy this 
inequality ; otherwise the number of bosons in the system would not be bounded. 

The thermal average of the number of particles in the orbital is found from 
the Gibbs sum by use of (5.62): 


1 
x A-texp(e/t) — 1 


arith a) ws d of ee 
S(e) = 4 ~ losg ape log(1 — x) = (9) 


or 


f(e) (10) 


~ exp[(e — w/t] — 1 


Bose-Einstein Distribution Function 
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Figure 6.6 Comparison of Bose-Einstein and Fermi-Dirac 
distribution functions. The classical regime is attained for 

(e — 4) > t, where the two distributions become nearly identical. 
We shall see in Chapter 7 that in the degenerate regime at low 
temperature the chemical potential y for a FD distribution is 
positive, and changes to negative at high temperature. 


This defines the Bose-Einstein distribution function. It differs mathematically 
from the Fermi-Dirac distribution function only by having —1 instead of +1 
in the denominator. The change can have very significant physical consequences, 
as we shall see in Chapter 7. The two distribution functions are compared in 
Figure 6.6. The ideal gas represents the limit ¢ — yx > t in which the two distri- 
bution functions are approximately equal, as discussed below. The choice of the 
zero of the energy « is always arbitrary. The particular choice made in any 
problem will affect the value of the chemical potential yu, but the value of the 
difference ¢ — y has to be independent of the choice of the zero of «. This point 
is discussed further in (20) below. 

A gas is in the classical regime when the average number of atoms in each 
orbital is much less than one. The average orbital occupancy for a gas at room 
temperature and atmospheric pressure is of the order of only 10~°, safely in the 
classical regime. Differences between fermions (half-integral spin) and bosons 
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Table 6.1 Comparison of the orbital occupancies in the classical 
and the quantum regimes 


Class of Thermal average occupancy 
Regime particle of any orbital 
Classical Fermion Always much less than one. 
Boson Always much less than one. 


Quantum Fermion Close to but less than one. 
Boson Orbital of lowest energy has 
an occupancy much greater than one. 


arise only for occupancies of the order of one or more, so that in the classical 
regime their equilibrium properties are identical. The quantum regime is the 
opposite of the classical regime. These characteristic features are summarized 
in Table 6.1. 


CLASSICAL LIMIT 


An ideal gas is defined as a system of free noninteracting particles in the classical 
regime. “Free” means confined in a box with no restrictions or external forces 
acting within the box. We develop the properties of an ideal gas with the use 
of the powerful method of the Gibbs sum. In Chapter 3 we treated the ideal gas 
by use of the partition function, but the identical particle problem encountered 
there was resolved by a method whose validity was not perfectly clear. 

The Fermi-Dirac and Bose-Einstein distribution functions in the classical 
limit lead to the identical result for the average number of atoms in an orbital. 
Write f(e) for the average occupancy of an orbital at energy «. Here ¢ is the 
energy of an orbital occupied by one particle; it is not the energy of a system of 
N particles. The Fermi-Dirac (FD) and Bose-Einstein (BE) distribution func- 
tions are 


1 


Sle) = exp[(e — w/t] +17 


(11) 


where the plus sign is for the FD distribution and the minus sign for the BE 
distribution. In order that f(e) be much smaller than unity for all orbitals, we 
must have in this classical regime 


exp[(e — w/t] > 1, (12) 


Chemical Potential 


for all «. When this inequality is satisfied we may neglect the term +1 in the 
denominator of (11). Then for either fermions or bosons, the average occupancy 
of an orbital of energy « is 


| fle) = exp[(u — e)/t] = Aexp(—e/z) , (13) 


with 4 = exp(/t). The limiting result (13) is called the classical distribution 
function. It is the limit of the Fermi-Dirac and Bose-Einstein distribution 
functions when the average occupancy f(e) is very small in comparison with 
unity. Equation (13), although called classical, is still a result for particles 
described by quantum mechanics: we shall find that the expression for A or 
always involves the quantum constant h. Any theory which contains h cannot 
be a classical theory. 

We use the classical distribution function f(e) = Aexp(—e/t) to study the 
thermal properties of the ideal gas. There are many topics of importance: the 
entropy, chemical potential, heat capacity, the pressure-volume-temperature 
relation, and the distribution of atomic velocities. To obtain results from the 
classical distribution function, we need first to find the chemical potential in 
terms of the concentration of atoms. 


Chemical Potential 


The chemical potential is found from the condition that the thermal average 
of the total number of atoms equals the number of atoms known to be present. 
This number must be the sum over all orbitals of the distribution function f(¢,): 


N={N) =D fe), (14) 


where s is the index of an orbital of energy ¢,. We start with a monatomic gas 
of N identical atoms of zero spin, and later we include spin and molecular 
modes of motion. The total number of atoms is the sum of the average number 
of atoms in each orbital. We use (13) in (14) to obtain 


N = dD exp(—«,/2). (15) 


To evaluate this sum, observe that the summation over free particle orbitals 
is just the partition function Z, for a single free atom in volume V, whence 
N =4Z,. 
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In Chapter 3 it was shown that Z, = ngV, where ng = (Mt/2mh*)*” is the 
quantum concentration. Thus 


N = AZ, = AnoV; 2 = N/ngV = n/ng , (16) 


in terms of the number density n = N/V. Finally, 


2 = exp(p/t) = n/ng , | (17) 
| 


| 
| 
| 
| 


which is equal to the number of atoms in the quantum volume 1/ng. In the 
classical regime n/ng is <1. The chemical potential of the ideal monatomic 
gas is 


= tlog(n/ng) , | (18) 


in agreement with (5.12a) obtained in another way. The result may be written 
out to give 


p= t[logN — logV — logt + 3log(2nh?/M)). (19) 


We see that the chemical potential increases with the concentration and with 
the temperature. 


STP ETS ES 


Comment: The simple expression (18) for the chemical potential can be subject to several 
modifications. We mention four examples. 


(a) If the zero of the energy scale is shifted by an energy A so that the zero of the kinetic 
energy of an orbital falls at ¢9 = A instead of at ¢9 = 0, then 


pu = A + tlog(n/ng). (20) 


(b) If the atoms have spin S, the number of orbitals in the sum in (15) is multiplied by the 
spin multiplicity 25 + 1. For spin 4 it is doubled; the value of the partition function Z, 
is doubled; ng will be replaced everywhere by 2ng, and the right-hand side of (18) 
will have an added term —tlog2. The effect of the spin on the entropy is treated 
below. 


(c) If the gas is not monatomic, the internal energy states associated with rotational and 
vibrational motion will enter the partition function, and the chemical potential will 


Free Energy 


have an added term —r log Z;,,, per (48) below, where Z,,, is the partition function of 
the internal degrees of freedom of one molecule. 


(d) If the gas is nonideal, the result for 4: may be considerably more complicated; see 
Chapter 10 for the relatively simple van der Waals approximation to a gas of inter- 
acting atoms. 


Rs CENA ES PSEA ES ar EE SRN Fae ashe SETS Ey 
Free Energy 


The chemical potential is related to the free energy by 
(OF/ON).y = Ks * (21) 
according to Chapter 5. From this, 
F(N.,V) = if AN y(N,t,V) = 1 [Pa [log +++], (22) 


where the integrand is found in brackets in (19). Now [dxlogx = xlogx — x, 
so that 


F = Nr[logN — 1 — logV — 3logt + log(2mh?/M)] , (23) 


or 


F = Nz[log(n/ng) — 1]. (24) 
| 


The free energy increases with concentration and temperature. 


FR Ses A ES Sas RE a ue 


Comment: The integral in (22) should strictly be a sum, because N is a discrete variable. 
Thus, from (5.6), 


N 
F(N,t,V) = ¥ p(N) , (25) 
N=1 
which differs from the integral only in the term in log N in (19), for 


N 
¥ logN = log(i x 2 x 3x --- x N)=logN!, (26) 
N=1 
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where the integral gave Nlog N — N in (23). But for large N the Stirling approximation 
logN! =~ NlogN-—N, (27) 
may be used, and now (25) is the same as (23). 


Pressure 
The pressure is related to the free energy by (3.49): 
p = —(6F/6V),.n. (28) 
With (23) for F we have 
[ . 
| p= NxV; py =sNts | (29) 
J 


which is the ideal gas law, as derived in Chapter 3. 


Energy 
The thermal energy U is found from F = U — to, or 


= ai ae 
U =F +16 =F — 1t(6F/6t)y.n o § =) PS (30) 
With (23) for F we have 
Es rer (31) 
Ot t)yn 2 
so that for an ideal gas 
ie | 
U = 4Nt. | (32) 


The factor 3 arises from the exponent of t in ng because the gas is in three 
dimensions; if ng were in one or two dimensions, the factor would be 4 or 1, 
respectively. The average kinetic energy of translational motion in the classical 
limit is equal to 47 or 4kgT per translational degree of freedom of an atom. 
The principle of equipartition of energy among degrees of freedom was discussed 
in Chapter 3. 


Heat Capacity 


A polyatomic molecule has rotational degrees of freedom, and the average 
energy of each rotational degree of freedom is +z when the temperature is high 
in comparison with the energy differences between the rotational energy levels 
of the molecule. The rotational energy is kinetic. A linear molecule has two 
degrees of rotational freedom which can be excited; a nonlinear molecule has 
three degrees of rotational freedom. 


Entropy 
The entropy is related to the free energy by 
o = —(6F/6t)y.y- (33) 


From (23) for F we have the entropy of an ideal gas: 


o = N[log(ng/n) + 3). (34) 


This is identical with our earlier result (3.76). In the classical regime n/ng is 
<1, so that log(ng/n) is positive. The result (34) is known as the Sackur-Tetrode 
equation for the absolute entropy of a monatomic ideal gas. It is important 
historically and is essential in the thermodynamics of chemical reactions. Even 
though the equation contains h, the result was inferred from experiments on 
vapor pressure and on equilibrium in chemical reactions long before the 
quantum-mechanical basis was fully understood. It was a great challenge to 
theoretical physicists to explain the Sackur-Tetrode equation, and many un- 
successful attempts to do so were made in the early years of this century. We 
shall encounter applications of the result in later chapters. 

The entropy of the ideal gas is directly proportional to the number of particles 
N if their concentration n is constant, as we see from (34). When two identical 
gases at identical conditions are placed side by side, each system having entropy 
o,, the total entropy is 2c, because N is doubled. If a valve that connects the 
systems is opened, the entropy is unchanged. We see that the entropy scales as 
the size of the system: the entropy is linear in the number of particles, at constant 
concentration. If the gases are not identical, the entropy increases when the 
valve is opened (Problem 6). 


Heat Capacity 


The heat capacity at constant volume is defined in Chapter 3 as 


Cy = 1(60/ét)y. (35) 
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We can calculate the derivative directly from the entropy (34) of an ideal gas 
when the expression for ng is written out: 


Cy =3N, (36) 


or Cy = 3Nkg in conventional units. 

The heat capacity at constant pressure is larger than Cy because additional 
heat must be added to perform the work needed to expand the volume of the 
gas against the constant pressure p, as discussed in detail in Chapter 8. We use 
the thermodynamic identity tda = dU + pdV to obtain 


aa aU av 
= 7 
“ , it Es ) % S } ¥ : 


The energy of an ideal gas depends only on the temperature, so that (CU/ér), 
will have the same value as (GU/ét)y, which is just Cy by the argument of (3.17b). 
By the ideal gas law V = Nt/p, so that the term p(6V/ét), = N. Thus (37) 
becomes 


C,= Cy +N (38a) 
in fundamental units, or 
C, = Cy + Nkg (38b) 


in conventional units. We notice again the different dimensions that heat 
capacities have in the two systems of units. For one mole, Nk, is usually written 
as R, called the gas constant. 

The results (38a,b) are written for an ideal gas without spin or other internal 
degrees of freedom of a molecule. For an atom Cy = 3N, so that 


C, =3N+N=4N (38c) 


Heat Capacity 


in fundamental units, or 


C, = 3Nkp (38d) 


in conventional units. The ratio C,/Cy is written as y, the Greek letter gamma. 


Example: Experimental tests of the Sackur-Tetrode equation. Experimental values of the 
entropy are often found from experimental values of C, by numerical integration of (37) 
to give at constant pressure 


a(t) — (0) = f° (C,/e\dz. (39) 


Here o(0) denotes the entropy at the lowest temperature attained in the measurements 
of C,. The third law of thermodynamics suggests that ¢(0) may be set equal to zero unless 
there are multiplicities not removed at the lowest temperature attained. 

We can calculate the entropy of a monatomic ideal gas by use of the Sackur-Tetrode 
equation (34). The value thus calculated at a selected temperature and pressure may be 
compared with the experimental value of the entropy of the gas. The experimental value 
is found by summing the following contributions: 


Entropy increase on heating solid from absolute zero to the melting point. 
Entropy increase in the solid-to-liquid transformation (discussed in Chapter 10). 
Entropy increase on heating liquid from melting point to the boiling point. 
Entropy increase in the liquid-to-gas transformation. 


Wie wos 


Entropy change on heating gas from the boiling point to the selected temperature and 
pressure. 


There may further be a slight correction to (34) for the nonideality of the gas. Comparisons 
of experimental and theoretical values have now been carried out for many gases, and very 
satisfactory agreement is found between the two sets of values.* 

We give details of the comparison for neon, after the measurements of Clusius. The 
entropy is given in terms of the conventional entropy S = kga. 


1, The heat capacity of the solid was measured from 12.3 K to the melting point 24.55 K 
under one atmosphere of pressure. The heat capacity of the solid below 12.3K was 
estimated by a Debye law (Chapter 4) extrapolation to absolute zero of the measure- 
ments above 12.3 K. The entropy of the solid at the melting point is found by numerical 
integration of { dT(C,/T) to be 


S.ona = 14.293 mol=*K=". 


* A classic study is “The heat capacity of oxygen from 12 K to its boiling point and its heat of 
vaporization. The entropy from spectroscopic data,” W. F. Giauque and H. L. Johnston, Journal of 
the American Chemical Society 51, 2300 (1929). 
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Table 6.2 Comparison of experimental and 
calculated values of the entropy at the boiling 
point under one atmosphere 


Entropy in Jmol~!K~* 


Gas Ty.p.. in K Experimental Calculated 


Ne 272 96.40 96.45 
Ar 87.29 129.75 129.24 
Kr 119.93 144.56 145.06 


souRCE: From Landolt Bérnstein tables, 6th ed., Vol. 2, 
Part 4, pp. 394-399. 


2. The heat input required to melt the solid at 24.55 K is observed to be 335 J mol~*. The 
associated entropy of melting is 


335J mol”! 


Se eae -1-1 
455K 13.64J mol” K7?. 


AS netting = 


3. The heat capacity of the liquid was measured from the melting point to the boiling 
point of 27.2 K under one atmosphere of pressure. The entropy increase was found to be 


ASjiquia = 3.85Jmol~*K~'. 


4. The heat input required to vaporize the liquid at 27.2K was observed to be 17613 
mol” !. The associated entropy of vaporization is 


1761 J mol~* 


AS aporization = 979K = 64.62 J mol”! K~}. 


The experimental value of the entropy of neon gas at 27.2 K at a pressure of one atmo- 
sphere adds up to 


Seas = Seotia + ASmeiting + AShiquia + ASvaporization = 96.40 Jmol"! K~?. 


gas 
The calculated value of the entropy of neon under the same conditions is 
Syas = 96.45Jmol~'K~' , 


from the Sackur-Tetrode equation. The excellent agreement with the experimental value 
gives us confidence in the basis of the entire theoretical apparatus that led to the Sackur- 
Tetrode equation. The result (34) could hardly have been guessed; to find it verified by 
observation is a real experience. Results for argon and krypton are given in Table 6.2. 


Chemical Potential of Ideal Gas with Internal Degrees of Freedom 


Chemical Potential of Ideal Gas with 
Internal Degrees of Freedom 


We consider now an ideal gas of identical polyatomic molecules. Each molecule 
has rotational and vibrational degrees of freedom in addition to the transla- 
tional degrees of freedom. The total energy ¢ of the molecule is the sum of two 
independent parts, 


& = & + Ein (40) 


where ¢j,, refers to the rotational and vibrational degrees of freedom and «, 
to the translational motion of the center of mass of the molecule. The vibrational 
energy problem is the harmonic oscillator problem treated earlier. The rota- 
tional energy was the subject of Problem 3.6. 

In the classical regime the Gibbs sum for the orbital n is 


g = 1+ dexp(—«,/1) . (41) 


where terms in higher powers of / are omitted because the average occupancy 
of the orbital n is assumed to be «1. That is, we neglect the terms in $ which 
correspond to occupancies greater than unity. In the presence of internal energy 
states the Gibbs sum associated with the orbital n becomes 


G=1+ 4 exp[—(e + ein)/t] (42) 


int 
or 


& = 14 Aexp(—e,/2)¥ exp(—é,,/7). (43) 


int 


The summation is just the partition function of the internal states: 


Zia = Y eXP(—Ein/t) . (44) 


which is related to the internal free energy of the one molecule by Fin, = 
—tlog Z;,,. From (43) the Gibbs sum is 


& = 1+ AZ, exp(—¢,/2). (45) 
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The probability that the translational orbital n is occupied, irrespective of 
the state of internal motion of the molecule, is given by the ratio of the term 
in 4 to the Gibbs sum 3 : 


4Ziint EXP(— En/T) 
1 + AZin €XP(—Eq/T) 


Sen) = = AZiq €XP(—Eq/T)- (46) 


The classical regime was defined earlier as f(e,) « 1. The result (46) is entirely 
analogous to (13) for the monatomic case, but 4Z;,, now plays the role of A. 

Several of the results derived for the monatomic ideal gas are different for 
the polyatomic ideal gas: 


(a) Equation (17) for A is replaced by 


A =nf(ngZin) (47) 


| 


with ng defined exactly as before. (We shall always use ng as defined for the 
monatomic ideal gas of atoms with zero spin.) Because 2 = exp(s/t) we 
have 


| w= t[log(n/ng) — log Zin]. (48) 


(b) The free energy is increased by, for N molecules, 


] 
Fin = —Ntlog Zin. | (49) 
(eee ee 


(c) The entropy is increased by 


Gin = — (OF in/Ct)v- (50) 


The former result U = 3Nt applies to the translational energy alone. 


Example: Spin entropy in zero magnetic field. Consider an atom of spin I, where J may 
represent both electronic and nuclear spins. The internal partition function associated 


Reversible Isothermal Expansion 


with the spin alone is 
Zin = (21 + 1), (51) 
this being the number of independent spin states. The spin contribution to the free energy is 
Fim = —tlog(2J + 1), (52) 
and the spin entropy is 
Gin, = log(2I + 1) , (53) 
by (50). The effect of the spin entropy on the chemical potential is found with the help of (48): 


H = t[log(n/ng) — log(2I + 1)]. (54) 
ST OE LTE 


Reversible Isothermal Expansion 


Consider as a model example 1 x 10? atoms of *He at an initial volume of 
10° cm? at 300K. Let the gas expand slowly at constant temperature until the 
volume is 2 x 10%cm*. The temperature is maintained constant by thermal 
contact with a large reservoir. In a reversible expansion the system at any 
instant is in its most probable configuration. 


What is the pressure after expansion? 

The final volume is twice the initial volume; the final temperature is equal 
to the initial temperature. From pV = Nr we see that the final pressure is 
one-half the initial pressure. 


What is the increase of entropy on expansion? 
The entropy of an ideal gas at constant temperature depends on volume as 


ao(V) = NlogV + constant , (55) 
whence 
62 — 0, = Nlog(V2/V,) = Nlog2 = (1 x 10?7)(0.693) = 0.069 x 107%. (56) 
Notice that the entropy is larger at the larger volume, because the system has 


more accessible states in the larger volume than in the smaller volume at the 
same temperature. 
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Direction of motion of 
piston in expansion 


Weights 


Heat flow Heat flow 


Figure 6.7 Work is done by the gas in an isothermal 
expansion. Here the gas does work by raising the weights. 
Under isothermal conditions pV is constant for an ideal gas, 
so that the pressure must be reduced to allow the volume to 
expand, The pressure is reduced by removing the load of 
weights a little at a time. 


How much work is done by the gas in the expansion? 
When the gas expands isothermally, it does work against a piston, as in 
Figure 6.7. The work done on the piston when the volume is doubled is 


iy pdV = E (Nt/V)dV = Nrlog(V/V,) = Ntlog2. (57) 

We evaluate Nr directly as 4.14 x 10% erg = 41.4J. Thus the work done on the 
piston is, from (57), 

Nrlog2 = (41.45)(0.693) = 28.7]. (58) 


The assumption that the process is reversible enters in (57) when we assume 
that a knowledge of V at every stage determines p at every stage of the expansion. 

We define W as the work done on the gas by external agencies. This is the 
negative of the work done by the gas on the piston. From (58), 


W = —{pdv = -28.7J. (59) 


Reversible Expansion at Constant Entropy 


What is the change of energy in the expansion? 

The energy of an ideal monatomic gas is U = }Nt and does not change in 
an expansion at constant temperature. However, the Helmholtz free energy 
decreases by Nt log 2, which is the work done. The connection is discussed in 
Chapter 8. 


How much heat flowed into the gas from the reservoir ? 

We have seen that the energy of the ideal gas remained constant when the 
gas did work on the piston. By conservation of energy it is necessary that a 
flow of energy in the form of heat into the gas occur from the reservoir through 
the walls of the container. The quantity Q of heat added to the gas must be 
equal, but be opposite in sign, to the work done by the piston, because 
Q + W =0. Thus 


C=23797 , (60) 
from the result (59). 


Reversible Expansion at Constant Entropy 


We considered above an expansion at constant temperature. Suppose instead 
that the gas expands reversibly from 1 x 10°cm? to 2 x 10% cm? in an in- 
sulated container. No heat flow to or from the gas is permitted, so that Q = 0. 
The entropy is constant in a system isolated from the reservoir if the expansion 
process is carried out reversibly (slowly). A process without a change of entropy 
is called an isentropic process or an adiabatic process. The term “adiabatic” 
has the specific meaning that there is no heat transfer in the process. For 
simplicity, we shall stick with “isentropic.” 


What is the temperature of the gas after expansion? 
The entropy of an ideal monatomic gas depends on the volume and the 
temperature as 


o(t,V) = N(logt*? + logV + constant) , (61) 
so that the entropy remains constant if 
logt*/?V = constant; ?V = constant. (62) 
In an expansion at constant entropy from V, to V, we have 


1,°?Y, = 1,°7V, (63) 


for an ideal monatomic gas. 
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We use the ideal gas law pV = Nr to obtain two alternate forms. We insert 
V = Nt/p into (63) and cancel N on both sides to obtain 


192 2 sf, en 
Pi P2 
Similarly, we insert t = pV/N in (63) to obtain 
pa 2Vise = ga-2Vr eo: or piV,5? = p2V,*?. (65) 


Both (64) and (65) hold only for a monatomic gas. 

It is the subject of Problem 10 to generalize these results for an ideal gas of 
molecules with internal degrees of motion (rotations, vibrations). We obtain 
for an isentropic process 


5 Ae 


i 


TV’? (66) 
| a “Ys (67) 
| PV," = p2V2". (68) 


WAL-Ny — v7 
1% PS ha 


Here y = C,/C, is the ratio of the heat capacities at constant pressure and 


constant volume. 
With T, = 300K and V,/V, = 4 we find from (63): 


T, = (3)?9(300K) = 189K. (69) 


This is the final temperature after the expansion at constant entropy. The gas 
is cooled in the expansion process by 


T, — T, = 300K — 189K = 111K. (70) 


Expansion at constant entropy is an important method of refrigeration. 


What is the change in energy in the expansion? 
The energy change is calculated from the temperature change (70). For an 
ideal monatomic gas 


Uz — Uy = Cyt, — %4) = 2N(t2 — 11) (71) 


Sudden Expansion into a Vacuum 


or, in conventional units, 


U, -e U, = 3Nk,(T> aa T,) 
= 3(1 x 10?7)(1.38 x 107'ergK~')(—111 K) 
= -23 x 108erg = —23J. (72) 


The energy decreases in an expansion at constant entropy. The work done by 
the gas is equal to the decrease in energy of the gas, which is U, — U, = 23J. 


Sudden Expansion into a Vacuum 


Let the gas expand suddenly into a vacuum from an initial volume of 1 liter to 
a final volume of 2 liters. This is an excellent example of an irreversible process. 
When a hole is opened in the partition to permit the expansion, the first atoms 
rush through the hole and strike the opposite wall. If no heat flow through the 
walls is permitted, there is no way for the atoms to lose their kinetic energy. 
The subsequent flow may be turbulent (irreversible), with different parts of the 
gas at different values of the energy density. Irreversible energy flow between 
regions will eventually equalize conditions throughout the gas. We assume the 
whole process occurs rapidly enough so that no heat flows in through the walls. 


How much work is done in the expansion? 

No means of doing external work is provided, so that the work done is zero. 
Zero work is not necessarily a characteristic of all irreversible processes, but 
the work is zero for expansion into a vacuum. 


What is the temperature after expansion? 

No work is done and no heat is added in the expansion: W = 0, Q = 0, and 
U, — U, = 0. Because the energy is unchanged, the temperature of the ideal 
gas is unchanged. The energy of a real gas may change in the process because 
the atoms are moved farther apart, which affects their interaction energy. 


What is the change of entropy in the expansion? 
The increase of entropy when the volume is doubled at constant temperature 
is given by (56): 


Ao = a) — 6; = Nlog2 = 0.069 x 107°. (73) 


For the expansion into a vacuum Q = 0. 
Expansion into a vacuum is not a reversible process: the system is not in the 
most probable (equilibrium) configuration at every stage of the expansion. Only 
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Table 6.3 Summary of ideal monatomic gas expansion experiments 


U,-U, 0a > Gy w Q 
Reversible V; V, V; 
isothermal 0 Nlog pe! —Nrlog ae Nrlog = 
expansion Vy Y, Y, 


Reversible V,\23 V,\23 
isentropic ~iwe[1 - (2) ] 0 -awe[1 - (i) | 0 
expansion V, Vy 


Irreversible V; 
expansion into 0 Nlog y, 0 0 
vacuum 1 


the initial configuration before removal of the partition and the final con- 
figuration after equilibration are most probable configurations. At intermediate 
stages the distribution in concentration and kinetic energy of atoms between 
the two regions into which the system is divided does not correspond to an 
equilibrium distribution. The central results of these calculations are sum- 
marized in Table 6.3. 


SUMMARY: STEPS LEADING TO THE IDEAL 
GAS LAW FOR SPINLESS MONATOMIC GAS 


(a) fle) = Aexp(—e/t) Occupancy of an orbital in the 
classical limit of f(e) « 1. 

(b) A= A Aull Bedi Given N, this equation determines 

Lexp(—6/1) im the classical limit, 
h? (xan \? F i 

(c) & = mys Energy of a free particle orbital of 
quantum number n in a cube of 
volume V. 


(d) Yexp(—e,/t) =42 fan n?exp(—<¢/t) Transformation of the sum to an 
Ls integral. 


(e) A = N/ngV Result of the integration (d) after 
subsitution in (b). 


(f) ng = (Mt/2nh?)??? Definition of the quantum 
concentration. 


Problems 


(g) # = tlog(n/ng) 
(h) F = aN uN.) = Nq[login/ng) — 1] 


(i) p = —(@F/6V),.y = Nt/V 


PROBLEMS 


1. Derivative of Fermi-Dirac function. Show that —60f/de evaluated at the 
Fermi level ¢ = has the value (47)~'. Thus the lower the temperature, the 
steeper the slope of the Fermi-Dirac function. 


2. Symmetry of filled and vacant orbitals. Let & = « + 6, so that f(z) ap- 
pears as f( + 6). Show that 


f(u + 6) = 1 — f(u — 6). (74) 


Thus the probability that an orbital 6 above the Fermi level is occupied is equal 
to the probability an orbital 5 below the Fermi level is vacant. A vacant orbital is 
sometimes known.as a hole. 


3. Distribution function for double occupancy statistics. Let us imagine a new 
mechanics in which the allowed occupancies of an orbital are 0, 1, and 2. The 
values of the energy associated with these occupancies are assumed to be 0, ¢, 
and 2e, respectively. 

(a) Derive an expression for the ensemble average occupancy <N), when the 
system composed of this orbital is in thermal and diffusive contact with a 
reservoir at temperature t and chemical potential 

(b) Return now to the usual quantum mechanics, and derive an expression 
for the ensemble average occupancy of an energy level which is doubly de- 
generate: that is, two orbitals have the identical energy ¢. If both orbitals are 
occupied the total energy is 2e. 


4. Energy of gas of extreme relativistic particles. Extreme relativistic parti- 
cles have momenta p such that pe > Mc?, where M is the rest mass of the 
particle. The de Broglie relation 2 = h/p for the quantum wavelength continues 
to apply. Show that the mean energy per particle of an extreme relativistic ideal 
gas is 3t ife = pc, incontrast to }r for the nonrelativistic problem. (An interesting 
variety of relativistic problems are discussed by E. Fermi in Notes on Thermo- 
dynamics and Statistics, University of Chicago Press, 1966, paperback.) 


5. Integration of the thermodynamic identity for an ideal gas. From the ther- 
modynamic identity at constant number of particles we have 


dU pdV @U 1 (au dV 
i = (3) dt + +(ep)av + = (75) 
Vv t 


7 e ot eV 
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Show by integration that for an ideal gas the entropy is 
o = Cylogt + NlogV + oy , (76) 


where a, is a constant, independent of t and V. 


6. Entropy of mixing. Suppose that a system of N atoms of type A is placed 
in diffusive contact with a system of N atoms of type B at the same temperature 
and volume. Show that after diffusive equilibrium is reached the total entropy 
is increased by 2N log 2. The entropy increase 2N log 2 is known as the entropy 
of mixing. If the atoms are identical (A = B), show that there is no increase in 
entropy when diffusive contact is established. The difference in the results has 
been called the Gibbs paradox. 


7. Relation of pressure and energy density. (a) Show that the average pres- 
sure in a system in thermal contact with a heat reservoir is given by 


¥ (68,/6V)y exp(—,/z) 


1h a Z (77) 


where the sum is over all states of the system. (b) Show for a gas of free particles 


that 
G,\ 2 
Fi), =.75p (78) 


as a result of the boundary conditions of the problem. The result holds equally 
whether ¢, refers to a state of N noninteracting particles or to an orbital. (c) Show 
that for a gas of free nonrelativistic particles 


p=2U3V, (79) 


where U is the thermal average energy of the system. This result is not limited 
to the classical regime; it holds equally for fermion and boson particles, as 
long as they are nonrelativistic. 


8. Time for a large fluctuation. We quoted Boltzmann to the effect that two 
gases in a 0.1 liter container will unmix only in a time enormously long compared 
to 10°" years. We shall investigate a related problem: we let a gas of atoms of 
*He occupy a container of volume of 0.1 liter at 300 K and a pressure of 1 atm, 
and we ask how long it will be before the atoms assume a configuration in 
which all are in one-half of the container. 

(a) Estimate the number of states accessible to the system in this initial 
condition. 


Problems 


(b) The gas is compressed isothermally to a volume of 0.05 liter. How many 
states are accessible now? 
(c) For the system in the 0.1 liter container, estimate the value of the ratio 


number of states for which all atoms are in one-half of the volume 
number of states for which the atoms are anywhere in the volume’ 


(d) If the collision rate of an atom is ~ 10'°s~', what is the total number of 
collisions of all atoms in the system in a year? We use this as a crude estimate 
of the frequency with which the state of the system changes. 

(e) Estimate the number of years you would expect to wait before all atoms 
are in one-half of the volume, starting from the equilibrium configuration. 


9. Gas of atoms with internal degree of freedom. Consider an ideal mon- 
atomic gas, but one for which the atom has two internal energy states, one an 
energy A above the other. There are N atoms in volume V at temperature t. 
Find the (a) chemical potential; (b) free energy; (c) entropy; (d) pressure; 
(e) heat capacity at constant pressure. 


10. Isentropic relations of ideal gas. (a) Show that the differential changes for 
an ideal gas in an isentropic process satisfy 


GD Ramer ce Boner 7 eee 
pS pag my 1) = 05 


dp y dat 
Bites) sayy 


0. (80) 


where 7 = C,/Cy: these relations apply even if the molecules have internal 
degrees of freedom. (b) The isentropic and isothermal bulk moduli are defined as 


B, = —V(ép/éV),;  B, = —V(ép/eV),. (81) 


Show that for an ideal gas B, = yp: B, = p. The velocity of sound in a gas is 
given by c = (B,/p)'?; there is very little heat transfer in a sound wave. For an 
ideal gas of molecules of mass M we have p = pt/M, so that c = (yt/M)!. 
Here p is the mass density. 


11. Convective isentropic equilibrium of the atmosphere. The lower 10-15 km 
of the atmosphere—the troposphere—is often in a convective steady state 
at constant entropy, not constant temperature. In such equilibrium pV’ is 
independent of altitude, where y = C,/Cy. Use the condition of mechanical 
equilibrium in a uniform gravitational field to: (a) Show that dT/dz = constant, 
where z is the altitude. This quantity, important in meteorology, is called the 
dry adiabatic lapse rate. (Do not use the barometric pressure relation that was 
derived in Chapter 5 for an isothermal atmosphere.) (b) Estimate dT/dz, in 
°C per km. Take y = 7/5. (c) Show that p « p’, where p is the mass density. 
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If the actual temperature gradient is greater than the isentropic gradient, the 
atmosphere may be unstable with respect to convection. 


12. Ideal gas in two dimensions. (a) Find the chemical potential of an ideal 
monatomic gas in two dimensions, with N atoms confined to a square of area 
A = L?. The spin is zero. (b) Find an expression for the energy U of the gas. 
(c) Find an expression for the entropy . The temperature is t. 


13. Gibbs sum for ideal gas. (a) With the help of Zy = (ngV)X/N! from 
Chapter 3, show that the Gibbs sum for an ideal gas of identical atoms 
is 3 = exp(AngV). (b) Show that the probability there are N atoms in the gas 
in volume V in diffusive contact with a reservoir is 


P(N) = <N>Nexp(—<N))/N! , (82) 


which is just the Poisson distribution function (Appendix C). Here <N) is the 
thermal average number of atoms in the volume, which we have evaluated 
previously as <N> = AVng. (c) Confirm that P(N) above satisfies 


¥ P(N) = 1 and > NP(N) = <N). 
N 


14. Ideal gas calculations. Consider one mole of an ideal monatomic gas at 
300K and 1 atm. First, let the gas expand isothermally and reversibly to twice 
the initial volume; second, let this be followed by an isentropic expansion from 
twice to four times the initial volume. (a) How much heat (in joules) is added to 
the gas in each of these two processes? (b) What is the temperature at the end of 
the second process? Suppose the first process is replaced by an irreversible 
expansion into a vacuum, to a total volume twice the initial volume. (c) What 
is the increase of entropy in the irreversible expansion, in joules per kelvin? 


15. Diesel engine compression. A diesel engine is an internal combustion 
engine in which fuel is sprayed into the cylinders after the air charge has been 
so highly compressed that it has attained a temperature sufficient to ignite the 
fuel. Assume that the air in the cylinders is compressed isentropically from an 
initial temperature of 27°C (300 K). If the compression ratio is 15, what is the 
maximum temperature in °C to which the air is heated by the compression? 
Take y = 1.4. 
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It is a fundamental result of quantum theory that all particles, including atoms 
and molecules, are either fermions or bosons. They behave alike in the classical 
regime in which the concentration is small in comparison with the quantum 
concentration, 


n « Ng = (Mt/2nh?)?/?. (1) 


Whenever n > ng the gas is said to be in the quantum regime and is called a 
quantum gas. The difference in physical properties between a quantum gas of 
fermions and one of bosons is dramatic, and both are unlike a gas in the classical 
regime. A Fermi gas or liquid has a high kinetic energy, low heat capacity, low 
magnetic susceptibility, low interparticle collision rate, and exerts a high 
pressure on the container, even at absolute zero. A Bose gas or liquid has a 
high concentration of particles in the ground orbital, and these particles— 
called the Bose condensate—may act as a superfluid, with practically zero 
viscosity. 

For many systems the concentration n is fixed, and the temperature is the 
important variable. The quantum regime obtains when the temperature is 
below 


To) = (2nh?/M)n??, (2) 


defined by the condition n = ng. A gas in the quantum regime with t « to 
is often said to be a degenerate gas*. 

It was realized by Nernst that the entropy of a classical gas diverges as log t 
as t > 0. Quantum theory removes the difficulty: both fermion and boson 
gases approach a unique ground state as t > 0, so that the entropy goes to 
zero. We say that the entropy is squeezed out on cooling a quantum gas (see 
Problems 3 and 8). 

In the classical regime (Chapter 6) the thermal average number of particles 
in an orbital of energy « is given by 


fle) = exp[(u — 8)/t]. (3) 


* Here we have the second distinct usage of the word “degenerate” in statistical physics. The first 
usage was introduced in Chapter 1, where we called an energy level degenerate if more than one 
state has the same energy. 
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With the result for 4 appropriate to this regime, 
fle) = (n/ng)exp(—é/t) . (4) 


with the usual choice of the origin of ¢ at zero for the energy of the lowest orbital. 
The form (4) assures us that the average occupancy of any orbital is always 
<n/ng, which is «1, consistent with our original picture of the classical regime. 

A fermion is any particle—elementary or composite—with a half-integral 
spin. A fermion is limited by the Pauli exclusion principle to an orbital occu- 
pancy of 0 or 1, with an average occupancy anywhere between these limits. At 
low temperatures it is clear that many low-lying orbitals will have one fermion 
in each orbital. At absolute zero all orbitals with 0 < ¢ < ép will be occupied 
with f = 1. Here é, is the energy below which there are just enough orbitals 
to hold the number of particles assigned to the system. This energy is called the 
Fermi energy. Above ¢, all orbitals will have f = 0 at t = 0. Ast increases the 
distribution function will develop a high energy tail, as in Figure 7.3. 

Bosons have integral or zero spin. They may be elementary or composite; 
if composite, they must be made up of an even number of elementary particles 
if these have spin 4, for there is no way to arrive at an integer from an odd 
number of half-integers. The Pauli principle does not apply to bosons, so there 
is no limit on the occupancy of any orbital. At absolute zero the ground 
orbital—the orbital of lowest energy—is occupied by all the particles in the 
system. As the temperature is increased the lowest single orbital loses its popula- 
tion only slowly, and each excited orbital—any orbital of higher energy—will 
contain a relatively small number of particles. We shall discuss this point 
carefully. Above t= tT, the ground orbital loses its special feature, and its 
occupancy becomes much like that of any low-lying excited orbital. 


FERMI GAS 


A Fermi gas is called degenerate when the temperature is low in comparison 
with the Fermi energy. When the inequality t « ¢, is satisfied the orbitals of 
energy lower than the Fermi energy ¢, will be almost entirely occupied, and the 
orbitals of higher energy will be almost entirely vacant. An orbital is occupied 
fully when it contains one fermion. A Fermi gas is said to be nondegenerate 
when the temperature is high compared with the Fermi energy, as in the classical 
regime treated in Chapter 6. 

The important applications of the theory of degenerate Fermi gases include 
conduction electrons in metals; the white dwarf stars; liquid *He; and nuclear 
matter. The most striking property of a fermion gas is the high kinetic energy 
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Figure 7.1 (a) The energies of the orbitals n = 1, 2,..., 10 for an electron 
confined to a line of length L. Each level corresponds to two orbitals, one for 
spin up and one for spin down. (b) The ground state of a system of 16 electrons. 
Orbitals above the shaded region are vacant in the ground state. 


of the ground state of the system at absolute zero. Suppose that it is necessary 
to accommodate N noninteracting electrons in a length L in one dimension. 
What orbitals will be occupied in the ground state of the N electron system? 
In a one-dimensional crystal the quantum number of a free electron orbital of 
form sin(nzx/L) is a positive integer n, supplemented by the spin quantum 
number m, = +4 for spin up or spin down. 

If the system has 8 electrons, then in the ground state the orbitals with 
n = 1, 2, 3, 4 and with m, = +4 are filled, and the orbitals of higher n are 
empty. Any other arrangement gives a higher energy. To construct the ground 
state we fill the orbitals starting from n = 1 at the bottom, and we continue 
filling higher orbitals with electrons until all N electrons are accommodated. 
The orbitals that are filled in the ground state of a system of 16 electrons are 
shown in Figure 7.1. 


Ground State of Fermi Gas in Three Dimensions 


Ground State of Fermi Gas in Three Dimensions 


Let the system be a cube of side L and volume V = L’*. The orbitals have the 
form of (3.58) and their energy is given by (3.59). The Fermi energy é, is the 
energy of the highest filled orbital at absolute zero; it is determined by the 
requirement that the system in the ground state hold N electrons, with each 
orbital filled with one electron up to the energy 


h? (nn,\? 
om es ©) 
Here ng is the radius of a sphere (in the space of the integers n,, n,, n,) that 
separates filled and empty orbitals. For the system to hold N electrons the 
orbitals must be filled up to n, determined by 


ed ines np = N/m). (6) 


The factor 2 arises because an electron has two possible spin orientations. The 
factor } arises because only triplets n,, n,, ”_ in the positive octant of the sphere 
in n space are to be counted. The volume of the sphere is 4nn,°/3. We may 
then write (5) as 


h? (32N\23 hg | 
= ( V ) a eerie: 0) 


This relates the Fermi energy to the electron concentration N/V = n. The 
so-called “Fermi temperature” ty is defined as tp = ef. 
The total energy of the system in the ground state is 


3 2 
Up =2 Dm =2x4x anf." dni? o, = = (7) "dnn*, (8) 


nsnr m\L 


with ¢, = (h?/2m)(xn/L)?. In (8) and (9), nis an integer and is not N/V. Consistent 
with (6), we have let 


2¥Y 6) > 242) J dnn?(-*°) (9) 


in the conversion of the sum into an integral. Integration of (8) gives the total 
ground state kinetic energy: 


Bh \s 3h? (nn,p\? 
= = a5) 
COS Ch (7) BE ia G ) Dire Nor (10) 
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is) 


Uy in units of 10'? ergs 


0 10 20 30 40 50 60 70 80 90 100 
Volume, in cm* 


Figure 7.2. Total ground state energy Uy of one mole of 
electrons, versus volume. 


using (5) and (6). The average kinetic energy per particle is Uo/N and is 3 of 
the Fermi energy ey. At constant N the energy increases as the volume decreases 
(Figure 7.2), so that the Fermi energy gives a repulsive contribution to the 
binding of any material; in most metals and in white dwarf and neutron stars 
it is the most important repulsive interaction. That is, the Fermi energy tends 
to increase the volume. It is balanced in metals by the Coulomb attraction 
between electrons and ions and in the stars by gravitational attraction. 


Density of States 


Thermal averages for independent particle problems have the form 


(X) = Yeast wXn » (11) 


where n denotes the quantum orbital; X,, is the value of the quantity X in 
the orbital n; and f(¢,,t,) is the thermal average occupancy, called the dis- 
tribution function, of the orbital n. We often express (X) as an integral over the 
orbital energy ¢. Then (11) becomes 


{X) = fide DloiflarX . (12) 


Density of States 


where the sum over orbitals has been transformed to an integral by the sub- 
stitution 


LOI fae Dee. (13) 


Here ®(e)de is the number of orbitals of energy between ¢ and ¢ + de. The 
quantity D(e) is nearly always called the density of states, although it is more 
accurate to call it the density of orbitals because it refers to the solutions of a 

one particle problem and not to the states of the N particle system. 
Consider an example of the calculation of O(c). We see from (7) that the 
number N of free electron orbitals of energy less than or equal to some ¢ is 
N(e) = (V/3n2)(2m/h?)?!263_, (14) 

for volume V. Take the logarithm of both sides: 

log N = }loge + constant, (15) 


and take differentials of log N and loge: 
olen Aaa (16) 
é 


The quantity dN = (3N/2e)de is the number of orbitals of energy between «¢ 
and ¢ + de, so that 
P(e) = dN/de = 3N(e)/2e (17) 


is the density of orbitals. The two spin orientations of an electron have been 
counted throughout this derivation because they were counted in (6). We can 
write D(z) as a function of ¢ alone because 


N(e)/e = (V/3n?)(2m/h?)?2e/?_, (18) 


from (14). Then (17) becomes 


| 3/2 
| 210 = 222)". | o 
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Figure 7.3 Density of orbitals as a function 

of energy, for a free electron gas in three 
dimensions. The dashed curve represents the 
density f(2)D(e) of occupied orbitals at a finite 
temperature, but such that t is small in 
comparison with ¢,. The shaded area represents 
the occupied orbitals at absolute zero. 


Density of orbitals 


Energy, ¢2—= 


When multiplied by the distribution function (Figure 6.3), the density of 
orbitals D(2) becomes D(e)f(e), the density of occupied orbitals (Figure 7.3). The 
total number of electrons in a system may now be written as 


N= [dD oflecw . (20) 
where f() is the Fermi-Dirac distribution function described in Chapter 6. In 
problems where we know the total number of particles, we determine ji: by 


requiring that the total number of particles calculated from (20) be equal to the 
correct value. The total kinetic energy of the electrons is 


U = fF deeBlerfie,cw. (21) 


If the system is in the ground state, all orbitals are filled up to the energy ep, 
above which they are vacant. The number of electrons is equal to 


Ne tig deB(e) . (22) 
and the energy is 


C= I bag de eD(e). (23) 


Heat Capacity of Electron Gas 


Heat Capacity of Electron Gas 


We derive a quantitative expression for the heat capacity of a degenerate Fermi 
gas of electrons in three dimensions. The calculation is perhaps the most im- 
pressive accomplishment of the theory of the degenerate Fermi gas. For an 
ideal monatomic gas the heat capacity is $N, but for electrons in a metal very 
much lower values are found. The calculation that follows gives excellent agree- 
ment with the experimental results. The increase in the total energy of a system 
of N electrons when heated from 0 to t is denoted by AU = U(z) — U(0), 
whence 


AU = Ae de eD(e)f(e) — fig de eD(e). (24) 


Here f(e) is the Fermi-Dirac function, and ®(e) is the number of orbitals per 
unit energy range. We multiply the identity 


N= ih * de f(e)B(e) = i de D(e) (25) 


by é,p to obtain 
(sr + ie )a Epf(e)O(e) = fr dé épD(é). (26) 


We use (26) to rewrite (24) as 
AU = [” de(e — ep)f(ey®(e) + F de(ep — e)[1 — fle)] Be). (27) 


The first integral on the right-hand side of (27) gives the energy needed to take 
electrons from é- to the orbitals of energy ¢ > ¢,, and the second integral gives 
the energy needed to bring the electrons to ¢, from orbitals below ¢,-. Both 
contributions to the energy are positive. The product f(e)®(e)de in the first 
integral is the number of electrons elevated to orbitals in the energy range 
de at an energy ¢. The factor [1 — f(e)] in the second integral is the probability 
that an electron has been removed from an orbital ¢. The function AU is plotted 
in Figure 7.4. In Figure 7.5 we plot the Fermi-Dirac distribution function versus 
é, for six values of the temperature. The electron concentration of the Fermi 
gas was taken such that e¢/kg = 50000 K, characteristic of the conduction 
electrons in a metal. 

The heat capacity of the electron gas is found on differentiating AU with 
respect to t. The only temperature-dependent term in (27) is f(e), whence we 
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1.0 
Figure 7.4 Temperature dependence of the energy of & 
a noninteracting fermion gas in three dimensions. The zs 0.5 
energy is plotted in normalized form as AU/Nep, 3 
where N is the number of electrons. The temperature 
0 —————E 
0 0,2 0.4 0.6 0.8 1.0 


0.8F 
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e/k,, in units of 10*K 


Figure 7.5 Fermi-Dirac distribution function at various temperatures, for 

Ty = &p/kg = 50000 K. The results apply to a gas in three dimensions. The total 
number of particles is constant, independent of temperature. The chemical potential at 
each temperature was calculated with the help of Eq. (20) and may be read off the 


graph as the energy at which f = 4. Courtesy of B. Feldman. 
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Region of degenerate quantum gas 


Region of classical gas 


Figure 7.6 Plot of the chemical potential 1 versus temperature t for a 
gas of noninteracting fermions in three dimensions. For convenience in 
plotting, the units of and t are 0.763¢,. 


can group terms to obtain 


dU * df 
Ca = ne =|, de(e — on) 1 90. (28) 


At the temperatures of interest in metals t/e¢ < 0.01, and we see from Figure 7.5 
that the derivative df/dz is large only at energies near ér. It is a good approxi- 
mation to evaluate the density of orbitals O(c) at er and take it outside of the 
integral: 


df 


Ca = Dlep) if ” de(e — ef) ss (29) 


Examination of the graphs in Figures 7.6 and 7.7 of the variation of with t 
suggests that when t « ¢, we ignore the temperature dependence of the chemical 
potential uz in the Fermi-Dirac distribution function and replace y by the 
constant é¢. We have then: 


df _&—8, exp[(e — éf)/t] (0) 
det? fexp[(e — ep)/t] + 1} 


t. 
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1.05 


One dimension 
Figure 7.7 Variation with temperature of the 

chemical potential yu, for free electron Fermi 

gases in one and three dimensions. In common 1.00 


metals t/ey ~ 0.01 at room temperature, so f: 
that 1 is closely equal to ep. These curves were ep 
calculated from series expansions of the integral aphreerdimensGne 
for the number of particles in the system. Reece 

0.95 

0 0.1 0.2 
T 
tp 
We set 
x = (e — e,)/t, (31) 


and it follows from (29) and (30) that 


oo e 
Cu = tOHXEz) een OX @ap 


(32) 
We may safely replace the lower limit by —0o because the factor e* in the 
integrand is already negligible at x = —e,/t if we are concerned with low 
temperatures such that ¢-/t ~ 100 or more. The integral* becomes 


aa 


ea || 
| fi dex eri 3’ eo) 


* The integral is not elementary, but may be evaluated from the more familiar result 


fra ate 
oe +1 12a? 


on differentiation of both sides with respect to the parameter a. 
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whence we have for the heat capacity of an electron gas, when t « Tp, 


| 
| Ca = 4? Dep). (34) 
| 
In conventional units, 
Ce = 4? D(epky2T. (35) 


We found that the density of orbitals at the Fermi energy is 
Der) = 3N/2ep = 3N/2t (36) 


for a free electron gas, with te = ¢. Do not be deceived by the notation t,: 
it is not the temperature of the Fermi gas, but only a convenient reference point. 
For t « ty the gas is degenerate; for t > t, the gas is in the classical regime. 
Thus (34) becomes 


| Ca = 407 Nt/tp. (37) 


In conventional units there is an extra factor kg, so that 
Ca = S07? NkgT/Te , (38) 


where kg7, = ép. Again, T, is not an actual temperature, but only a reference 
point. 

We can give a physical explanation of the form of the result (37). When the 
specimen is heated from absolute zero, chiefly those electrons in states within 
an energy range t of the Fermi level are excited thermally, because the FD 
distribution function is affected over a region of the order of t in width, illus- 
trated by Figures 7.3 and 7.5. Thus the number of excited electrons is of the order 
of Nt/ep, and each of these has its energy increased approximately by t. The 
total electronic thermal energy is therefore of the order of U,, + Nt?/ep. Thus 
the electronic contribution to the heat capacity is given by 


Cy = dU,/dt = Ntjep = Nt/te , (39) 


which is directly proportional to t, in agreement with the exact result (34) and 
with the experimental results. 
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Table 7.1 Calculated Fermi energy parameters for free electrons 


Conduction Fermi 
electron Fermi temperature 

concentration Velocity energy Ty = g/kg, 
N/V, in cm~* ty,inems=! és, ineV inK 

Li 4.6 x 107? 1.3 x 10® 47 5.5 x 10* 

Na 2.5 11 aul 3.7 

K 1,34 0.85 vl 2.4 

Rb 1.08 0.79 18 21 

Cs 0.86 0.73 1.5 18 

Cu 8.50 1.56 7.0 8.2 

Ag 5.76 1.38 5.5 64 


Au 3.90 1.39 5.5 6.4 


Fermi Gas in Metals 


The alkali metals and copper, silver, and gold have one valence electron per 
atom, and the valence electron becomes the conduction electron in the metal. 
Thus the concentration of conduction electrons is equal to the concentration 
of atoms, which may be evaluated either from the density and the atomic 
weight or from the crystal lattice dimensions. 

If the conduction electrons act as a free fermion gas, the value of the Fermi 
energy ¢, may be calculated from (7): 


&p = (h?/2m)(327n)??. (40) 


Values of n and of ép are given in Table 7.1 and in Figure 7.8. The electron 
velocity vp at the Fermi surface is also given in the table; it is defined so that 
the kinetic energy is equal to é,: 


dmv,” = ef (41) 


where m is the mass of the electron. The values of the Fermi temperature Tp = 
tp/ky for ordinary metals are of the order of 5 x 10*K, so that the assumption 
T « T, used in the derivation of (35) is an excellent approximation at room 
temperature and below. 

The heat capacity of many metals at constant volume may be written as 
the sum of an electronic contribution and a lattice vibration contribution. At 
low temperatures the sum has the form 


Cy = yt + At? , (42) 
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10712 


Figure 7.8 Fermi energy ¢, of a free electron 
gas as a function of the concentration. 
Calculated values are shown for several 
monovalent metals. The straight line is drawn 


for ep = 5.835 x 10777 n®? ergs, with n in 
-3 


fp, in ergs 
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C/T = 2.08 + 2.57T? 
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C/T, in mJ mol“'K~? 
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Tey 


Figure 7.9 Experimental heat capacity values for potassium, plotted as C/T versus 
T*, After W. H. Lien and N. E. Phillips, Phys. Rev. 133, A1370 (1964). 


where + and A are constants characteristic of the material. Here y = 42?N/tp 
from (37), and the lattice vibration term At* was discussed in Chapter 4. The 
electronic term is linear in t and is dominant at sufficiently low temperatures. 
It is helpful to display the experimental values of the heat capacity for a given 
material as a plot of Cy/z versus 17: 


Cy/t =» + Ar’, (43) 


for then the points should lie on a straight line. The intercept at t = 0 gives 
the value of y. Such a plot is shown for potassium in Figure 7.9. Observed values 
of y are given in Tables 7.2 and 7.3. 


196 


Chapter 7: Fermi and Bose Gases 


Table 7.2 Experimental and free electron electronic 
heat capacities of monovalent metals 


7 (exp), Yo (free electron), 

Metal mJmol~'K~? mJ mol~1K~? vo 
Li 1.63 0.75 2.17 
Na 1.38 1.14 1.21 
K 2.08 1.69 1,23 
Rb 241 197 1.22 
Cs 3.20 2.36 1.35 
Cu 0.695 0.50 1.39 
Ag 0.646 0.65 1.00 
Au 0.729 0.65 1.13 


NOTE: The values of y and yo are in mJ mol~* K~?. 
source: Courtesy of N. E. Phillips. 


Table 7.3. Experimental values of electronic heat capacity constant y of metals 


Li Be B c N 
1.63 | 0.17 

Na Mg Al Si P 
1.38 | 13 RBS, 

K Ca | Sc Ti Vi Cr | Mny| Fe Co | Ni |Cu | Zn |Ga Ge | As 
2.08 | 2.9 | 10.7 | 3.35 9.26 | 1.40 | 9.20 | 4.98 | 4.73 | 7.02 | 0.695 | 0.64 | 0.596 0.19 
Rb See Tey Zr Nb | Mo | Te Ru | Rh | Pd Ag |Cd |In Sn | Sb 
2.41 | 3.6 | 102 | 280 | 7.79 | 20 |— 3.3 | 49 | 9.42 | 0.646 | 0.688 1.69 | 1.78 | 0.11 
Cs Ba | La Hf | Ta |W Re Os | Ir Pt |Au |Hg |TI Pb | Bi 
3.20 | 2.7 | 10. 216: | o9 13. i238 24 | 31 68 | 0.729) 1.79 | 1.47 | 2.98 | 0.008 


NOTE: The value of » is in mJ mol”! K~?. 
SOURCE: From compilations furnished by N. E. Phillips and N. Pearlman. 


White Dwarf Stars 
White dwarf stars have masses comparable to that of the Sun. The mass and 
radius of the Sun are 


Mo = 2.0 x 10°? g; Ro = 7.0 x 10°° cm. (44) 


The radii of white dwarfs are very small, perhaps 0.01 that of the Sun. The 
density of the Sun, which is a normal star, is of the order of 1gem~°, like that 
of water on the Earth. The densities of white dwarfs are exceedingly high, of the 


White Dwarf Stars 


order of 10* to 10’ gcm~ *. Atoms under the densities prevalent in white dwarfs 
are entirely ionized into nuclei and free electrons, and the electron gas is a 
degenerate gas, as will be shown below. 

The companion of Sirius was the first white dwarf to be discovered. In 1844 
Bessel observed that the path of the star Sirius oscillated slightly about a straight 
line, as ifit had an invisible companion. The companion, Sirius B, was discovered 
near its predicted position by Clark in 1862. The mass of Sirius B was determined 
to be 2.0 x 10°*g by measurements on the orbits. The radius of Sirius B is 
estimated as 2 x 10°cm by a comparison of the surface temperature and the 
radiant energy flux, using the properties of thermal radiant energy developed 
in Chapter 4. 

The mass and radius of Sirius B lead to the mean density 


M 2 x 1073 . 
=F ~ Ox Wamp * 7 * 10° gem”. sy 


This extraordinarily high density was appraised by Eddington in 1926 in the 
following words: “Apart from the incredibility of the result, there was no 
particular reason to view the calculation with suspicion.” Other white dwarfs 
have higher densities: that named Van Maanen No. 2 has a mean density 


100 times higher. 
Hydrogen atoms at a density of 10° gcm~* have a volume per atom equal to 


1 
~ (10®molem~)(6 x 1073 atoms mol~!) 


V4 = 2 x 10°3°cm peratom 


or 2 x 10-°A® peratom. The average nearest-neighbor separation is then of 
the order of 0.01 A, as compared with the internuclear separation of 0.74 Aina 
molecule of hydrogen. Under conditions of such high density the atomic 
electrons are no longer attached to individual nuclei. The electrons are ionized 
and form an electron gas. The matter in the white dwarfs is held together by 
gravitational attraction, which is the binding force in all stars. 

In the interior of white dwarf stars* the electron gas is degenerate; the 
temperature is much less than the Fermi energy ¢-. The Fermi energy of an 
electron gas at a concentration of 1 x 10°°electronscm~? is given by 


ép = (h?/2m)(3n7n)?? = 0.5 x 107° erg = 3 x 105 eV , (46) 


* A good discussion of white dwarf stars is given by W. K. Rose, Astrophysics, Holt, Rinehart, and 
Winston, 1973. 
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Table 7.4 Fermi energy of degenerate fermion gases 
(characteristic values) 


Phase of matter Particles Ty, inK 
Liquid *He atoms 03 
Metal electrons 5 x 10* 
White dwarf stars electrons 3 x 10° 
Nuclear matter nucleons 3 x 10'! 
Neutron stars neutrons Sxri¢'* 


about 10° higher than in a typical metal. The Fermi temperature ¢,/k, of the 
electrons is 3 x 10°K, as in Table 7.4. The actual temperature in the interior 
of a white dwarf is believed to be of the order of 10’ K. The electron gas in the 
interior of a white dwarf is highly degenerate because the thermal energy is much 
lower than the Fermi energy. 

Are the electron energies in the relativistic regime? This question arises 
because our theory of the Fermi gas has used the nonrelativistic expression 
p?/2m for the kinetic energy of an electron of momentum p. The energy equi- 
valence of the rest mass of an electron is 


& = mc? x (1 x 107?7g)(3 x 10! cms7!)? = 1 x 10°-erg. (47) 


This energy is of the same order as the Fermi energy (46). Thus relativistic 
effects will be significant, but not dominant. At higher densities the Fermi gas 
is relativistic. 


Nuclear Matter 


We consider the state of matter within nuclei. The neutrons and protons of 
which nuclear matter is composed form a degenerate fermion gas, at least 
qualitatively. We estimate here the Fermi energy of the nucleon gas: The radius 
of a nucleus that contains A nucleons is given by the empirical relation 


R = (1.3 x 107!3cm) x A’. (48) 


According to this relation the average volume per particle is constant, for the 
volume goes as R*, which is proportional to A. The concentration of nucleons 
in nuclear matter is 


A 


ae ae 39 W.-3 
Feld x 10am = Ott x 10% em™* , (49) 


n 


We 


Chemical Potential Near Absolute Zero 


about 10% times higher than the concentration of nucleons in a white dwarf 
star. Neutrons and protons are not identical particles. The Fermi energy of 
the neutrons need not equal the Fermi energy of the protons. The concentra- 
tion of one or the other, but not both, enters the familiar relation 


h? 


= —— (3x70?) 50 

fr = oy (327n) (50) 

For simplicity let us suppose that the number of protons is equal to the 
number of neutrons. Then 


Moccinas, 2 Pasotrons = 009'% 10°.em~*', (51) 
as obtained from (49) on dividing by 2. The Fermi energy is 
ép = (3.17 x 1073°)n?3 = 0.43 x 107>*erg = 27 Mev. (52) 


The average kinetic energy of a particle in a degenerate Fermi gas is 3 of the 
Fermi energy, so that in nuclear matter the average kinetic energy is 16 Mev 
per nucleon. 


BOSON GAS AND EINSTEIN CONDENSATION 


A very remarkable effect occurs in a gas of noninteracting bosons at a certain 
transition temperature, below which a substantial fraction of the total number 
of particles in the system will occupy the single orbital of lowest energy, called 
the ground orbital. Any other orbital, including the orbital of second lowest 
energy, at the same temperature will be occupied by a relatively negligible 
number of particles. The total occupancy of all orbitals will always be equal to 
the specified number of particles in the system. The ground-orbital effect is 
called the Einstein condensation. 

There would be nothing surprising to us in this result for the ground state 
occupancy if it were valid only below 10~'* K. This temperature is comparable 
with the energy spacing between the lowest and next lowest orbitals in a system 
of volume 1 cm?, as we show below. But the Einstein condensation temperature 
for a gas of fictitious noninteracting helium atoms at the observed density of 
liquid helium is very much higher, about 3K. Helium is the most familiar 
example of Einstein condensation in action. 


Chemical Potential Near Absolute Zero 


The key to the Einstein condensation is the behavior of the chemical potential 
of a boson system at low temperatures. The chemical potential is responsible 
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for the apparent stabilization of a large population of particles in the ground 
orbital. We consider a system composed of a large number N of noninteracting 
bosons. When the system is at absolute zero all particles occupy the lowest- 
energy orbital and the system is in the state of minimum energy. It is certainly 
not surprising that at t = 0 all particles should be in the orbital of lowest 
energy. We can show that a substantial fraction remains in the ground orbital 
at low, although experimentally obtainable, temperatures. 

If we put the energy of the ground orbital at zero on our energy scale, then 
from the Bose-Einstein distribution function 


1 
f(e,t) = expe — wa] —1 (53) 


we obtain the occupancy of the ground orbital at e = 0 as 


1 
S(0,t) = 


See 54) 
exp(—y/t) — 1 64) 


When 1 — 0 the occupancy of the ground orbital becomes equal to the total 
number of particles in the system, so that 


1 1 ©, 
i 0,7) = N & li ~ =--. 
sure 7) bay exp(—y/t) — 1 1 — (n/t) — 1 Lu 
Here we have made use of the series expansion exp(—x) = 1 — x +-::. We 


know that x, which is «/t, must be small in comparison with unity, for otherwise 
the total number of particles N could not be large. From this result we find 


N=-t/uy; w= -t/N, (55) 


ast > 0. For N = 107? at T = 1K, we have p = —1.4 x 1077%erg. We note 
from (55) that 


1 
j= / = ae oa 
2 = exp(u/t) = 1 y? (56) 


as tT + 0. The chemical potential in a boson system must always be lower in 
energy than the ground state orbital, in order that the occupancy of every orbital 
be non-negative. 


Chemical Potential Near Absolute Zero 


Example: Spacing of lowest and second lowest orbitals of free atoms. The energy of an 
orbital of an atom free to move in a cube of volume V = L? is 


cia 2 a 2 2 
e= 3a(z) (n,? + n,? + n,") 5 (57) 


where n,, n,, n, are positive integers. The energy e(111) of the lowest orbital is 


2 2 
au) = 75 (F)a +1 + is (58) 


and the energy ¢(211) of one of the set of next lowest orbitals is 


ANG 
e(211) = fault) (4+ 1+ 1). (59) 
The lowest excitation energy of the atom is 
Ae = e(211) — e(111) = 3 x GE (60) 
If M(*He) = 6.6 x 107-74+g and L = lcm, 
Ae = (3)(8.4 x 107 37)(9.86) = 2.48 x 1077°erg. (61) 


In temperature units, Ae/ky = 1.80 x 107'*+ K. 

This splitting is extremely small, and it is difficult to conceive that it can play an important 
part in a physical problem even at the lowest reasonably accessible temperatures such as 
1 mK, which is 10~* K. However, at the 1 mK temperature (55) gives p> —14 x 
10~** erg for N = 10? atoms, referred to the orbital (58) as the zero of energy. Thus 1 is 
much closer to the ground orbital than is the next lowest orbital (59), and exp{[e(111) — 
H)/t} is much closer to 1 than is exp{[e(211) — J/x}, so that e(111) dominates the dis- 
tribution function. 


The Boltzmann factor exp(—Aé/t) at 1 mK is 
exp(—1.8 x 10°") 1-18 x 10°"! , (62) 


which is essentially unity. By (4) we would expect that even ifn = ng the occupancy of the 
first excited orbital would only be of the order of 1. However, the Bose-Einstein distribution 
gives an entirely different value for the occupancy of the first excited orbital: 


1 1 


exp[(de — w/t] — 1 = expidei) — 1” e 


Sf (Ae,t) 


Me 
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because Az > yu. Thus the occupation of the first excited orbital at 1 mK is 


area ASI sce Oi (64) 


J ix. 


Ie 


so that the fraction of the N particles that are in this orbital is f/N ~ 5 x 10'°/10?? =~ 
5 x 107 !?, which is very small. We see that the occupancy of the first excited orbital at low 
temperatures is relatively very much lower than would be expected at first sight from the 
simple Boltzmann factor (62). The Bose-Einstein distribution is quite strange; it favors a 
situation in which the greatest part of the population is left in the ground orbital at suffi- 
ciently low temperatures. The particles in the ground orbital, as long as their number is > 1, 
are called the Bose-Einstein condensate. The atoms in the condensate act quite differently 
from the atoms in excited states. 

How do we understand the existence of the condensate? Suppose the atoms were 
governed by the Planck distribution (Chapter 4), which makes no provision for holding 
constant the total number of particles; instead, the thermal average number of photons 
increases with temperature at t*, as found in Problem 4.1. If the laws of nature restricted the 
total number of photons to a value N, we would say that the ground orbital of the photon 
gas contained the difference Ny = N — N(t) between the number allotted and the number 
thermally excited. The No nonexcited photons would be described as condensed into the 
ground orbital, but Ny becomes essentially zero at a temperature t, such that all N photons 
are excited. There is no actual constraint on the total number of photons; however, there 
is a constraint on the total number N of material bosons, such as “He atoms, in a system. 
This constraint is the origin of the condensation into the ground orbital. The difference 
between the Planck distribution and the Bose-Einstein distribution is that the latter will 
conserve the total number of particles, independent of temperature, so that nonexcited 
atoms are really in the ground state condensate. 


Orbital Occupancy Versus Temperature 


We saw in (19) that the number of free particle orbitals per unit energy range is 


V (2M\3? 
D(e) = a) i ; (65) 


for a particle of spin zero. The total number of atoms of helium-4 in the ground 
and excited orbitals is given by the sum of the occupancies of all orbitals: 


N = fy = Nols) + Nac) = Nolt) + [-" deD(eVfie. (66) 
We have separated the sum over n into two parts. Here No(t) has been written 


for f(0,t), the number of atoms in the ground orbital at temperature t. The 
integral in (66) gives the number of atoms N,(z) in all excited orbitals, with 


Orbital Occupancy Versus Temperature 


S(e, 7) 


Figure 7.10 Plot of the boson distribution function for two temperatures, with sufficient 
particles present to ensure A = 1. The integral of the distribution times the density of 
states gives the number N, of particles in excited orbitals; the rest of the particles present 
are condensed into the ground state orbital. The value of Ng is too large to be shown on 
the plot. 


f(é,t) as the Bose-Einstein distribution function. The integral gives only the 
number of atoms in excited orbitals and excludes the atoms in the ground 
orbital, because the function D(e) is zero at e = 0. To count the atoms correctly 
we must count separately the occupancy N, of the orbital with e = 0. Although 
only a single orbital is involved, the value of Ng may be very large in a gas of 
bosons. We shall call No the number of atoms in the condensed phase and N, 
the number of atoms in the normal phase. The whole secret of the result which 
follows is that at low temperatures the chemical potential y is very much closer 
in energy to the ground state orbital than the first excited orbital is to the 
ground state orbital. This closeness of y to the ground orbital loads most of 
the population of the system into the ground orbital (Figure 7.10). 


203 


204 


Chapter 7: Fermi and Bose Gases 


The Bose-Einstein distribution function when written for the orbital at 
é = Ois 


Nolt) = (67) 


gre 


as in (54), where A will depend on the temperature t. The number of particles 
in all excited orbitals increases as t*/?: 


V (2M\3? po ga 
Nat) 4n? Fr) Lf dt Teper) -1 


or, with x = ¢/t, 


V (2M\3? 4 px x2 
Nis) = (Fe) ot (Ye (68) 


Notice the factor t*/? which gives the temperature dependence of N,. 

At sufficiently low temperatures the number of particles in the ground state 
will be a very large number. Equation (67) tells us that 2 must be very close to 
unity whenever No is > 1. Then / is very accurately constant, because a mac- 
roscopic value of No forces / to be close to unity. The condition for the validity 
of the calculation is that No > 1, and it is not required that N, < N. When 
¢  t in the integrand, the value of the integrand is insensitive to small devia- 
tions of 2 from 1, so that we can set A = 1 in (68), although not in (67). 


The value of the integral* in (68) is, when 2 = 1, 


ee x! 2 
f, dx S— = 1.306n"7. (69) 


* To evaluate the integral we write 
1/2 1/2,-x x 
% x Es B alet:) x - 
f dx mf dx ==) if dx x1/2e-= 
) e-1 o Ve vy cgeee 


= ( Si sat) te dy y'?e*, 
s=1 


The infinite sum is easily evaluated numerically to be 2.612. The integral may be transformed with 
y=w? to give 


2 i duw? exp(—u?) = 4/7. 


Einstein Condensation Temperature 


Thus the number of atoms in excited states is 


N. 


_ 1.306V (2Mt 
Ta mh? 


3/2 
) = 2.612ngV , (70) 


where ng = (Mt/2rh*)** is again the quantum concentration. We divide N, by 
N to obtain the fraction of atoms in excited orbitals: 


N./N = 2.612ngV/N = 2.612no/n. (71) 


The value 4 =~ 1 or 1 — 1/N which led to (71) is valid as long as a large 
number of atoms are in the ground state. All particles have to be in some 
orbital, either in an excited orbital or in the ground orbital. The number in 
excited orbitals is relatively insensitive to small changes in A, but the rest of 
the particles have to be in the ground orbital. To assure this we must take / 
very close to 1 as long as N, is a large number. Even 10° is a large number for 
the occupancy of an orbital. Yet within At/t, = 10~° of the transition, where 
tr is defined by (72) below, the occupancy of the ground orbital is > 10'S atoms 
cm” * at the concentration of liquid *He. Thus our argument is highly accurate 
at At/t, = 107°. 


Einstein Condensation Temperature 


We define the Einstein condensation temperature* zt, as the temperature for 
which the number of atoms in excited states is equal to the total number of 
atoms. That is, N,(t¢) = N. Above t, the occupancy of the ground orbital is 
not a macroscopic number; below t, the occupancy is macroscopic. From (70) 
with N for N, we find for the condensation temperature 


nh? ( N \23 
ae i (sur) ; (72) 


Now (71) may be written as 


N,/N = (t/t,)?? , (73) 


where N is the total number of atoms. The number of atoms in excited orbitals 
varies as t*/? at temperatures below t,. as shown in Figure 7.11. The calculated 
value of T; for atoms of *He is +3 K. 


* A. Einstein, Akademie der Wissenschaften, Berlin, Sitzungsberichte 1924, 261; 1925, 3. 
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S) 
component 


Normal fluid 
component 


0 0.2 0.4 0.6 0.8 1.0 
t/t, —~ 

Figure 7.11 Condensed boson gas: temperature dependence 
of the proportion No/N of atoms in the ground orbital and 
of the proportion N,/N of atoms in all excited orbitals. We 
have labeled the two components as normal and superfluid 
to agree with the customary description of liquid helium. 
The slopes of all three curves are intended to be zero at t = 0. 


The number of particles in the ground orbital is found from (73): 
No = N —N, = N[1 — (c/te)*?]. (74) 


We note that N may be of the order of 107”. For t even slightly less than ty a 
large number of particles will be in the ground orbital, as we see in Figure 7.11. 
We have said that the particles in the ground orbital below t form the condensed 
phase or the superfluid phase. 

The condensation temperature in kelvin is given by the numerical relation 


Tp(in K) = (115/Vy7/3M) , (75) 


where Vj, is the molar volume in cm? mol”! and M is the molecular weight. 
For liquid helium Vj, = 27.6cm* mol”! and M = 4; thus Ty = 3.1K. 


Liquid *He 


Liquid *He 

The calculated temperature of 3 K is suggestively close to the actual temperature 
of 2.17 K at which a transition to a new state of matter is observed to take place 
in liquid helium (Figure 7.12). We believe that in liquid *He below 2.17 K there 
is a condensation of a substantial fraction of the atoms of *He into the ground 
orbital of the system. This is different from the condensation in coordinate 
space that occurs in the condensation of a gas to a liquid. Evidently the inter- 
atomic forces that lead to the liquefaction of *He at 4.2K under a pressure of 
one atmosphere are too weak to destroy the major effects of the boson con- 
densation at 2.17K. In this respect the liquid behaves as a gas. The condensa- 
tion into the ground orbital is certainly connected with the properties of bosons. 
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Figure 7.12 Heat capacity of liquid *He. The sharp 
peak near 2.17 K is evidence of an important transition 
in the nature of the liquid. The viscosity of the liquid 
above the transition temperature is typical of normal 
liquids, whereas the viscosity below the transition as 
determined by rate of flow through narrow slits is 
vanishingly small, at least 10° times smaller than the 
viscosity above the transition. The transition is often 
called a lambda transition merely because of the shape 
of the graph. After Keesom et al. 
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The condensation is normally not permitted for fermions, but pairs of fermions 
may act as bosons, as in the superconductivity of electron pairs (Cooper pairs) 
in metals. A different type of transition to complex phases with superfluid 
properties has been observed in liquid *He below 3mK. Atoms of *He have 
spin } and are fermions, but pairs of 7He atoms act as bosons. 

We can give several arguments in support of our view of liquid helium as a 
gas of noninteracting particles. At first sight this is a drastic oversimplification 
of the problem, but there are some important features of liquid helium for 
which the view is correct. 

(a) The molar volume of liquid *He at absolute zero is 3.1 times the volume 
that we calculate from the known interactions of helium atoms. The interaction 
forces between pairs of helium atoms are well known experimentally and 
theoretically, and from these forces by standard elementary methods of solid 
state physics we can calculate the equilibrium volume of a static lattice of 
helium atoms. In a typical calculation we find the molar volume to be 9 cm? 
mol” ', as compared with the observed 27.6 cm? mol~!. Thus the kinetic motion 
of the helium atoms has a large effect on the liquid state and leads to an ex- 
panded structure in which the atoms to a certain extent can move freely over 
appreciable distances. We can say that the quantum zero-point motion is 
responsible for the expansion of the molar volume. 

(b) The transport properties of liquid helium in the normal state are not very 
different from those of a normal classical gas. In particular, the ratio of the 
thermal conductivity K to the product of the viscosity 7 times the heat capacity 
per unit mass has the values 


K i at 28K 


nCy 32, at 4.0K 


These values are quite close to those observed for normal gases at room 
temperature—see Table 14.3. The values of the transport coefficients them- 
selves in the liquid are within an order of magnitude of those calculated for the 
gas at the same density. Normal liquids act quite differently. 

(c) The forces in the liquid are relatively weak, and the liquid does not exist 
above the critical temperature of 5.2K, which is the maximum boiling point 
observed. The binding energy would be perhaps ten times stronger in the 
equilibrium configuration of a static lattice, but the expansion of the molar 
volume by the quantum zero-point motion of the atoms is responsible for the 
reduction in the binding energy to the observed value. The value of the critical 
temperature is directly proportional to the binding energy. 

(d) The liquid is stable at absolute zero at pressures under 25atm: above 
25 atm the solid is more stable. 


Liquid “He 


Rate of mass flow, 
in units of 10-8g s~>1 ——~ 


Figure 7.13 Comparison of rates of flow of liquid *He and 
liquid *He under gravity through a fine hole. Notice the sudden 
onset of high fluidity or superfluidity in *He. After D. W. 
Osborne, B. Weinstock, and B. M. Abraham, Phys. Rev.75, 
988 (1949). 


The new state of matter into which liquid *He enters when cooled below 
2.17K has quite astonishing properties. The viscosity as measured in a flow 
experiment* is essentially zero (Figure 7.13), and the thermal conductivity is 
very high. We say that liquid *He below the transition temperature is a super- 
fluid. More precisely, we denote liquid *He below the transition temperature 
as liquid He II, and we say that liquid He II is a mixture of normal fluid and 
superfluid components. The normal fluid component consists of the helium 
atoms in thermally excited orbitals, and the superfluid component consists of 
the helium atoms condensed into the ground orbital. It is known that the 
radioactive boson °He in solution in liquid *He does not take part in the 
superflow of the latter; neither, of course, does the fermion *He in solution in 
*He take part in the superflow. 

We speak of liquid *He above the transition temperature as liquid He I. 
There is no superfluid component in liquid He I, for here the ground orbital 
occupancy is negligible, being of the same order of magnitude as the occupancy 


* In other arrangements there may be an effective viscosity: this is true of a disk oscillating in liquid 
*He at any finite temperature below the condensation temperature. For a combination of two 
fluids of different viscosities, some experiments measure the average viscosity, and other experiments 
measure the average of 1/n, or the average fluidity. 
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Figure 7.14 The melting curve of liquid and Melting cu 
solid helium (*He), and the transition curve 100 
between the two forms of liquid helium, He I } 
and He II. The liquid He II form exhibits = 
superflow properties as a consequence of the 5 15 
condensation of atoms into the ground orbital 2 
of the system. Note that helium is a liquid at Z Solid helium 
absolute zero at pressures below 25 atm. The & 50 
liquid-vapor boiling curve is not included in \/ 
this graph as it would merge with the zero Liquid He I 
pressure line. After C. A. Swenson, Phys. Rev. 
79, 626 (1950). 25 
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of any other low-lying orbital, as we have seen. The regions of pressure and 
temperature in which liquid He I and II exist are shown in Figure 7.14. 

The development of superfluid properties is not an automatic consequence 
of the Einstein condensation of atoms into the ground orbital. Advanced cal- 
culations show that it is the existence of some form (almost any form) of inter- 
action among atoms that leads to the development of superfluid properties in 
the atoms condensed in the ground orbital. 


Phase Relations of Helium 


The phase diagram of *He was shown in Figure 7.14. The liquid-vapor curve 
can be followed from the critical point of 5.2K down to absolute zero without 
any appearance of the solid. At the transition temperature the normal liquid, 
called He I, makes a transition to the form with superfluid properties, called 
He II. A temperature called the 4 point is the triple point at which liquid He I, 
liquid He II, and vapor coexist. Keesom, who first solidified helium, found that 
the solid* did not exist below a pressure of 25 atm. Another triple point exists 


* An interesting discussion of solid helium is given by B. Bertram and R. A. Guyer, Scientific Ameri- 
can, August 1967, pp. 85-95, Solid *He exists in three crystal structures according to the conditions 
of temperature and pressure. 
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Figure 7.15 Phase diagrams for liquid *He, (a) in 
kelvin and (b) in millikelvin. In the region of negative 
slope shown in (a) on the phase boundary the solid has 
a higher entropy than the liquid, and we have to add 
heat to the liquid to solidify it. Superfluid properties 
appear in (b) in the A and B phases of liquid *He. The 

A phase is double—in a magnetic field the phase divides 
into two components with opposite nuclear magnetic 
moments. 


at 1.743 K: here the solid is in equilibrium with the two liquid modifications, 
He I and He II. The two triple points are connected by a line that separates the 
regions of existence of He II and He I. 

The phase diagram of *He differs in a remarkable way from the phase diagram 
of “He. Figure 7.15 exhibits the importance of the fermion nature of *He. Note 
the negative slope of the coexistence curve at low temperatures. As explained 
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in Chapter 10, the negative slope means that the entropy of the liquid phase 
is lower than the entropy of the solid phase. 


Quasiparticles and Superfluidity, “He 


For many purposes the superfluid component of liquid helium II behaves as 
if it were a vacuum, as if it were not there at all. The Ny atoms of the superfluid 
are condensed into the ground orbital and have no excitation energy, for the 
ground orbital by definition has no excitation energy. The superfluid has energy 
only when the center of mass of the superfluid is given a velocity relative to the 
laboratory reference frame—as when the superfluid is set into flow relative to 
the laboratory. 

The condensed component of No atoms will flow with zero viscosity so long 
as the flow does not create excitations in the superfluid—that is, so long as no 
atoms make transitions between the ground orbital and the excited orbitals. 
Such transitions might be caused by collisions of helium atoms with irregu- 
larities in the wall of the tube through which the helium atoms are flowing. 
The transitions, if they occur, are a cause of energy loss and of momentum 
loss from the moving fluid, and the flow is not resistanceless if such collisions 
can occur. 

The criterion for superfluidity involves the energy and momentum relation- 
ship of the excitations in liquid He II. If the excited orbitals were really like the 
orbitals of free atoms, with a free particle relation 


@ = 4Mv? = — (hk)? (76) 


between the energy ¢ and the momentum Mtv or hk of an atom, then we can 
show that superfluidity would not be expected. Here k = 2/wavelength. But 
because of the existence of interactions between the atoms the low energy 
excitations do not resemble free particle excitations, but are longitudinal sound 
waves, longitudinal phonons (Chapter 4). After all, it is not unreasonable that a 
longitudinal sound wave should propagate in any liquid, even though we have 
no previous experience of superliquids. 

A language has grown up to describe the low-lying excited states of a system 
of many atoms. These states are called elementary excitations and in their 
particle aspect the states are called quasiparticles. Longitudinal phonons are 
the elementary excitations of liquid He II. We shall give the clear-cut experi- 
mental evidence for this, but first we derive a necessary condition for super- 
fluidity. This condition will show us why the phonon-like nature of the 
elementary excitations leads to the superfluid behavior of liquid He II. 


Quasiparticles and Superfluidity, *He 
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We consider in Figure 7.16 a body, perhaps a steel ball or a neutron, of 
mass Mg falling with velocity V down a column of liquid helium at rest at 
absolute zero, so that initially no elementary excitations are excited. If the 
motion of the body generates elementary excitations, there will be a damping 
force on the body. In order to generate an elementary excitation of energy & 
and momentum hk, we must satisfy the law of conservation of energy: 


MV? = 4MoV? + & , (77) 


where V’ is the velocity of the body after creation of the elementary excitation. 
Furthermore, we must satisfy the law of conservation of momentum 


MoV = MoV’ + tik. (78) 


The two conservation laws cannot always be satisfied at the same time even 
if the direction of the excitation created in the process is unrestricted. To show 
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Figure 7.16 Body of mass My moving with 
velocity V down a cylinder that contains liquid 
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this we rewrite (78) as 
MV — hk = MV’ 


and take the square of both sides: 


Mo?V? — 2MohV «k + h?k? = My2V. 
On multiplication by 1/2M, we have 
3M V? — AV-k + ate = 4M,V". (79) 
0 
We subtract (79) from (77) to obtain 
1 2p-2 
AV-k — 5 Wk ah (80) 


0 


There is a lowest value of the magnitude of the velocity V for which this 
equation can be satisfied. The lowest value will occur when the direction of k 
is parallel to that of V. This critical velocity is given by 


Vv. = minimum of (81) 


The condition is a little simpler to express if we let the mass My of the body 
become very large, for then 


V. = minimum ofS. (82) 


| 
| 
A body moving with a lower velocity than V. will not be able to create excitations 
in the liquid, so that the motion will be resistanceless. The viscosity will appear 
to be zero. A body moving with higher velocity will encounter resistance 
because of the generation of excitations. 

There is a simple geometrical construction for (82). We make a plot of the 
energy «, of an elementary excitation as a function of the momentum hk of the 
excitation. We construct the straight line from the origin which just touches 
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the curve from below. The slope of this line is equal to the critical velocity. If 
& = h?k?/2M, as for the excitation of a free atom, the straight line has zero 
slope and the critical velocity is zero: 


Free atoms: Vv. = minimum of hk/2M = 0. (83) 


The energy of a low energy phonon in liquid He II is a, = ho, = hv,k in the 
frequency region of sound waves where the product of wavelength and frequency 
is equal to the velocity of sound v,, or where the circular frequency «, is equal 
to the product of v, times the wavevector k. Now the critical velocity is 


Phonons: V, = minimum of hv,k/hk = v,. (84) 


The critical velocity V, is equal to the velocity of sound if (84) is valid for all 
wavevectors, which it is not in liquid helium II. The observed critical flow 
velocities are indeed nonzero, but considerably lower than the velocity of sound 
and usually lower than the solid straight line in Figure 7.17, presumably because 
the plot of ¢, versus hk may turn downward at very high hk. 

The actual spectrum of elementary excitations in liquid helium II has been 
determined by the observations on the inelastic scattering of slow neutrons. 
The experimental results are shown in Figure 7.17. The solid straight line is the 
Landau critical velocity for the range of wavevectors covered by the neutron 
experiments, and for this line the critical velocity is 


V. = Ajhky ~ 5 x 103cms7! , (85) 


where A and ky are identified on the figure. 

Charged ions of helium in solution in liquid helium II under certain experi- 
mental conditions of pressure and temperature have been observed* to move 
almost like free particles and to have a limiting drift velocity near 5 x 10° cms” * 
closely equal to the calculated value of (85), Under other experimental condi- 
tions the motion of the ions is limited at a lower velocity by the creation of 
vortex rings. Such vortex rings are transverse modes of motion and do not 
appear in the longitudinal modes covered by Figure 7.17. 

Our result (84) for a necessary condition for the critical velocity is more 
general than the calculation we have given. Our calculation demonstrates that 
a body will move without resistance through liquid He II at absolute zero if 
the velocity V of the body is less than the critical velocity V.. However, at 


* L. Meyer and F. Reif, Phys. Rev. 123, 727 (1961); G. W. Rayfield, Phys. Rev. Letters 16, 934 (1966). 


215 


216 


Chapter 7: Fermi and Bose Gases 


ois 


Energy, in units of 1.38 x 10-6 erg 


2.0 3.0 
Wavevector, in units of 108 em! 


Figure 7.17 Energy « versus wavevector k of elementary 
excitations in liquid helium at 1.12 K. The parabolic curve rising 
from the origin represents the theoretically calculated curve for 
free helium atoms at absolute zero. The open circles correspond to 
the energy and momentum of the measured excitations. A smooth 
curve has been drawn through the points. The broken curve 

rising linearly from the origin is the theoretical phonon branch 
with a velocity of sound of 237 m s~!. The solid straight line gives 
the critical velocity, in appropriate units: The line gives the mini- 
mum of ¢,/k over the region of k covered in these experiments. After 
D. G. Henshaw and A. D. B. Woods, Phys. Rev. 121, 1266 (1961). 


temperatures above absolute zero, but below the Einstein temperature, there 
will be a normal fluid component of elementary excitations that are thermally 
excited. The normal fluid component is the source of resistance to the motion 
of the body. The superflow aspect appears first in experiments in which the 
liquid flows out through a fine tube in the side of a container. The normal 
fluid component may remain behind in the container while the superfluid 
component leaks out without resistance. The derivation we have given of the 


Summary 


critical velocity also holds for this situation, with V as the velocity of the super- 
fluid relative to the walls of the tube; Mg is the mass of the fluid. Excitations 
would be created above V, by the interaction between the flow of the liquid 
and any mechanical irregularity in the walls. 


Superfluid Phases of *He 


Three superfluid phases of liquid *He are known* (Figure 7.15b), but—in 
contrast to liquid *He—with transition temperatures of only a few millikelvin. 
The superfluid phases are believed to be qualitatively similar to the super- 
conducting state of electrons in metals, where pairs of particles in orbitals near 
the Fermi surface form a type of bound state known as a Cooper pair. Such a 
pair is qualitatively like a diatomic molecule, but the radius of the molecule 
is much larger than the average interelectron spacing in a metal or the average 
interparticle spacing in liquid *He. 

In metallic superconductivity the two electrons that form a Cooper pair are 
in a nonmagnetic (singlet) spin state. In the superfluid states of liquid He the 
two atoms that form a pair are in the triplet spin states of the two 3He nuclei, 
so that three magnetic superfluids are possible, corresponding to spin orienta- 
tions M, = 1, 0, and —1, or mixtures of these three states. The magnetic 
superfluids have been explored experimentally, and both the magnetic and 
superfluid properties have been confirmed. 


SUMMARY 


1. Compared to a classical gas, a Fermi gas at low temperature has high kinetic 
energy, high pressure, and low heat capacity. The entropy of the Fermi gas 
is zero in the ground state. The energy of the highest filled orbital in the 
ground state of a free particle gas of fermions of spin } is 


_ 1 (3nNny? 
ate SH en 


2. The total kinetic energy in the ground state is 


* For elementary reviews, see J. C. Wheatley, Physics Today, February 1976, p. 32; A. J. Leggett, 
Physics Bulletin 25, 311 (1975); and J. R. Hook, Physics Bulletin 29, 513 (1978). For deeper reviews, 
see J. C. Wheatley, Rev. Mod. Phys. 47,415 (1975) and A. J. Leggett, Rev. Mod. Phys. 47, 331 (1975). 
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3. The density of orbitals at ge, is 
Dep) = 3N/2ep. 
4. The heat capacity of an electron gas at t « Tt, is 
Ca = 40? Deg)t ~ Nt/te , 


in fundamental units. 


5. For a Bose gas at t < t, the fraction of atoms in excited orbitals is 
N./N = 2.612ng/n = (t/t_)?'?. 


6. The Einstein condensation temperature of a gas of noninteracting bosons is 


_ 2h? (NY? 
ey itu Deine 


PROBLEMS 


1. Density of orbitals in one and two dimensions. (a) Show that the density 
of orbitals of a free electron in one dimension is 


®,(e) = (L/n)(2m/h?2)'? , (86) 


where L is the length of the line. (b) Show that in two dimensions, for a square 
of area A, 


D2(e) = Am/rh? , (87) 
independent of «. 


2. Energy of relativistic Fermi gas. For electrons with an energy ¢ > mc?, 
where m is the rest mass of the electron, the energy is given by ¢ ~ pc, where 
p is the momentum. For electrons in a cube of volume V = L? the momentum 
is of the form (xh/L), multiplied by (n,2 + n,? + n.*)!?, exactly as for the 
nonrelativistic limit. (a) Show that in this extreme relativistic limit the Fermi 
energy of a gas of N electrons is given by 


ép = hne(3n/n)'3 , (88) 


Problems 


where n = N/V. (b) Show that the total energy of the ground state of the gas is 
Uo = Wer. (89) 
The general problem is treated by F. Jiittner, Zeitschrift fiir Physik 47,542 (1928). 


3. Pressure and entropy of degenerate Fermi gas. (a) Show that a Fermi 
electron gas in the ground state exerts a pressure 


(327)?/3 he N 5/3 
BT ee a 


In a uniform decrease of the volume of a cube every orbital has its energy 
raised: The energy of an orbital is proportional to 1/L? or to 1/V?/*. (b) Find 
an expression for the entropy of a Fermi electron gas in the region t « é,. 
Notice that ¢ + 0ast > 0. 


4. Chemical potential versus temperature. Explain graphically why the initial 
curvature of j versus t is upward for a fermion gas in one dimension and 
downward in three dimensions (Figure 7.7). Hint: The ®,(<) and ,(«) curves 
are different, where ®, is given in Problem 1. It will be found useful to set up 
the integral for N, the number of particles, and to consider from the graphs 
the behavior of the integrand between zero temperature and a finite temperature. 


5. Liquid *He as a Fermi gas. The atom *He has spin J = }and isa fermion. 
(a) Calculate as in Table 7.1 the Fermi sphere parameters vp, ¢p, and Ty for 
3He at absolute zero, viewed as a gas of noninteracting fermions. The density 
of the liquid is 0.081 gcm~ >. (b) Calculate the heat capacity at low temperatures 
T « T,; and compare with the experimental value Cy = 2.89NkgT as observed 
for T < 0.1K by A. C. Anderson, W. Reese, and J. C. Wheatley, Phys. Rev. 
130, 495 (1963): see also Figure 7.18. Excellent surveys of the properties of 
liquid *He are given by J. Wilks, Properties of liquid and solid helium, Oxford, 
1967, and by J. C. Wheatley, “Dilute solutions of *He in *He at low tem- 
peratures,” American Journal of Physics 36, 181-210 (1968). The principles 
of refrigerators based on *He—*He mixtures are reviewed in Chapter 12 on 
eryogenics; such refrigerators produce steady temperatures down to 0.01 K 
in continuously acting operation. 


6. Mass-radius relationship for white dwarfs. Consider a white dwarf of mass 
Mand radius R. Let the electrons be degenerate but nonrelativistic; the protons 
are nondegenerate. (a) Show that the order of magnitude of the gravitational 
self-energy is —GM?/R, where G is the gravitational constant. (If the mass 
density is constant within the sphere of radius R, the exact potential energy is 
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C/T, relative scale 


Temperature, in K 


Figure 7.18 Heat capacity of liquid 3He and of a 5 percent solution of 
3He in liquid *He. The quantity plotted on the vertical axis is C/T, and 
the horizontal axis is T. Thus for a Fermi gas in the degenerate temperature 
region the theoretical curves of C/T at constant volume are horizontal. 
The curve for pure *He is taken at constant pressure, which accounts for 
the slight slope. The curve for the solution of *He in liquid *He indicates 
that the *He in solution acts as a Fermi gas; the degenerate region at low 
temperature goes over to the nondegenerate region at higher temperature, 
The solid line through the experimental points for the solution is drawn 
for Ty = 0.331 K, which agrees with the calculation for free atoms if the 
effective mass is taken as 2.38 times the mass of an atom of 3He. Curves 
after J. C. Wheatley, Amer. J. Physics 36 (1968). 


—3GM7?/5R). (b) Show that the order of magnitude of the kinetic energy of the 
electrons in the ground state is 


h?N53 h?M5/3 
mR? ~ mM ,,°'3R? * 


where m is the mass of an electron and My is the mass of a proton. (c) Show 
that if the gravitational and kinetic energies are of the same order of magnitude 
(as required by the virial theorem of mechanics), M1/3R x 102° g/cm. (d) Ifthe 
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mass is equal to that of the Sun (2 x 10%*g), what is the density of the white 
dwarf? (e) It is believed that pulsars are stars composed of a cold degenerate 
gas of neutrons. Show that for a neutron star M!/3R = 10'’ g!/3cm. What is 
the value of the radius for a neutron star with a mass equal to that of the Sun? 
Express the result in km. 


7. Photon condensation. Consider a science fiction universe in which the 
number of photons N is constant, at a concentration of 107° cm~*. The number 
of thermally excited photons we assume is given by the result of Problem 4.1, 
which is N, = 2.404Vt3/n7h?c°. Find the critical temperature in K below which 
N, < N. The excess N — N, will be in the photon mode of lowest frequency; 
the excess might be described as a photon condensate in which there is a large 
concentration of photons in the lowest mode. In reality there is no such principle 
that the total number of photons be constant, hence there is no photon 
condensate. 


8. Energy, heat capacity, and entropy of degenerate boson gas. Find expres- 
sions as a function of temperature in the region t < t, for the energy, heat 
capacity, and entropy of a gas of N noninteracting bosons of spin zero confined 
to a volume V. Put the definite integral in dimensionless form; it need not be 
evaluated. The calculated heat capacity above and below t¢ is shown in 
Figure 7.19. The experimental curve was shown in Figure 7.12. The difference 
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between the two curves is marked: It is ascribed to the effect of interactions 
between the atoms. 


9. Boson gas in one dimension. Calculate the integral for N,(t) for a one- 
dimensional gas of noninteracting bosons, and show that the integral does 
not converge. This result suggests that a boson ground state condensate does 
not form in one dimension. Take / = 1 for the calculation. (The problem should 
really be treated by means of a sum over orbitals on a finite line.) 


10. Relativistic white dwarf stars. Consider a Fermi gas of N electrons each 
of rest mass m in a sphere of radius R, Conditions in certain white dwarfs are 
such that the great majority of electrons have extreme relativistic kinetic 
energies ¢ ~ pc, where p is the momentum. The de Broglie relation remains 
2. = 2nh/p. Problem 2 gives the ground state kinetic energy of the N electrons 
on the assumption that ¢ = pe for all electrons. Treat the sphere as a cube 
of equal volume. (a) Use the standard virial theorem argument to predict the 
value of N. Assume that the whole star is ionized hydrogen, but neglect the 
kinetic energy of the protons compared to that of the electrons. (b) Estimate 
the value of N. A careful treatment by Chandrasekhar leads not to a single 
value of N, but to a limit above which a stable white dwarf cannot exist: see 
D. D. Clayton, Principles of stellar evolution and nucleosynthesis, McGraw-Hill, 
1968, p. 161; M. Harwit, Astrophysical concepts, Wiley, 1973. 


11. Fluctuations in a Fermi gas. Show for a single orbital of a fermion system 
that 


<(AN)?> = <ND>(1 — <ND) . (91) 


if <N> is the average number of fermions in that orbital. Notice that the 
fluctuation vanishes for orbitals with energies deep enough below the Fermi 
energy so that <N) = 1. By definition, AN = N — <N). 


12. Fluctuations in a Bose gas. \f <N)> as in (11) is the average occupancy 
of a single orbital of a boson system, then from (5.83) show that 


<(AN)?> = <N>(L + <N)). (92) 


Thus if the occupancy is large, with <N) > 1, the fractional fluctuations are 
of the order of unity: <(AN)?>/<N)>? = 1, so that the actual fluctuations can 
be enormous. It has been said that “bosons travel in flocks.” The first edition 
of this text has an elementary discussion of the fluctuations of photons. 


13. Chemical potential versus concentration. (a) Sketch carefully the chemical 
potential versus the number of particles for a boson gas in volume V at 


Problems 


temperature t. Include both classical and quantum regimes. (b) Do the same 
for a system of fermions. 


14. Two orbital boson system. Consider a system of N bosons of spin zero, 
with orbitals at the single particle energies 0 and ¢. The chemical potential is 
u, and the temperature is t. Find t such that the thermal average population 
of the lowest orbital is twice the population of the orbital at ¢. Assume N > 1 
and make what approximations are reasonable. 
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DEFINITION OF HEAT AND WORK 


HEAT ENGINES: CONVERSION 
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Carnot Inequality 

Sources of Irreversibility 

Refrigerators 

Air Conditioners and Heat Pumps 

Carnot Cycle 

Example: Carnot Cycle for an Ideal Gas 

Energy Conversion and the Second Law of Thermodynamics 
Path Dependence of Heat and Work 

Irreversible Work 

Example: Sudden Expansion of an Ideal Gas 


HEAT AND WORK AT CONSTANT 
TEMPERATURE OR CONSTANT PRESSURE 


Isothermal Work 

Isobaric Heat and Work 

Example: Electrolysis and Fuel Cells 
Chemical Work 

Example: Chemical Work for an Ideal Gas 
Magnetic Work and Superconductors 
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7. Light Bulb in a Refrigerator 259 
8. Geothermal Energy 259 
9. Cooling of Nonmetallic Solid to T= 0 259 
10. Irreversible Expansion of a Fermi Gas 259 


Note: In (and only in) the discussion of energy conversion devices that operate in cycles, we shall 
define all energy, entropy, and heat transfers as positive, whether the flow is into or out of the device. 
This convention avoids needless difficulties with algebraic signs. The term “reversible” includes 
processes for which the combined entropy of the interacting systems remains constant. 


Energy and Entropy Transfer: Definition of Heat and Work 


ENERGY AND ENTROPY TRANSFER: 
DEFINITION OF HEAT AND WORK 


Heat and work are two different forms of energy transfer. Heat is the transfer 
of energy to a system by thermal contact with a reservoir. Work is the transfer 
of energy to a system by a change in the external parameters that describe the 
system. The parameters may include volume, magnetic field, electric field, or 
gravitational potential. The reason we distinguish heat from work will be clear 
when we discuss energy conversion processes. 

The most important physical process in a modern energy-intensive civiliza- 
tion is the conversion of heat into work. The Industrial Revolution was made 
possible by the steam engine, which converts heat to work. The internal com- 
bustion engine, which seems to dominate man as much as it serves him, is a 
device to convert heat to work. The problem of understanding the limitations 
of the steam engine gave rise to much of the development of thermodynamics. 
Energy conversion remains one of the central applications of thermal physics 
because most electrical energy is generated from heat. 

The fundamental difference between heat and work is the difference in the 
entropy transfer. Consider the energy transfer dU from a reservoir to a system 
with which the reservoir is in thermal contact at temperature t; an entropy 
transfer do = dU/t accompanies the energy transfer, according to the argument 
of Chapter 2. This energy transfer is what we defined above as heat, and we see 
it is accompanied by entropy transfer. Work, being energy transfer by a change 
in external parameters—such as the position of a piston—does not transfer any 
entropy to the system. There is no place for entropy to come from when only 2 
work is performed or transferred. 

However, we must be careful: the total energy of two systems brought into 
contact is conserved, but their total entropy is not necessarily conserved and 
may increase. The entropy transfer between two systems in thermal contact is 
well defined only ifthe entropy of one system increases by as much as the entropy 
of the other decreases. Let us restrict ourselves for the present to reversible 
processes such that the combined entropy of the interacting systems remains 
constant. Later we will generalize the discussion to irreversible processes which 
are processes in which the total entropy of the two systems increases, as in the 
heat flow example in Chapter 2. 


227 


228 


Chapter 8: Heat and Work 


We can give a quantitative expression to the distinction between heat and 
work. Let dU be the energy change of a system during a reversible process; do 
is the entropy change, and t is the temperature. We define 


dQ = tdo (1) 


as the heat received by the system in the process. By the principle of conservation 
of energy, 


dU =4W +40, (2) 


which says that the energy change is caused partly by work done on the system 
and partly by heat added to the system from the reservoir. Then 


aW = dU — dQ = dU — tdo (3) 


is the work performed on the system in the reversible process. Our reasons for 
designating heat and work by dQ and dW rather than dQ and dW are explained 
below. For do = 0, we have pure work; for dU = tdo, pure heat. 


HEAT ENGINES: CONVERSION 
OF HEAT INTO WORK 


Carnot inequality. Heat and work have different roles in energy conversion 
processes because of the difference in entropy transfer. Consider two conse- 
quences of the difference: 

(a) All types of work are freely convertible into mechanical work and into 
each other, because the entropy transfer is zero. An ideal electrical motor, 
without mechanical friction or electrical resistance, is a device to convert 
electrical work into mechanical work. An ideal electrical generator converts 
mechanical work into electrical work. Because all forms of work are freely 
convertible, they are thermodynamically equivalent to each other and, in 
particular, equivalent to mechanical work. The term work denotes all types of 
work. 

(b) Work can be completely converted into heat, but the inverse is not true: 
heat cannot be completely converted into work. Entropy enters the system with 
the heat, but does not leave the system with the work. A device that generates 
work from heat must necessarily strip the entropy from the heat that has been 
converted to work. The entropy removed from the converted input heat cannot 
be permitted to pile up inside the device indefinitely; this entropy must ulti- 
mately be removed from the device. The only way to do this is to provide more 
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Figure 8.1 Entropy and energy flow in any continuously operating 
reversible device generating work from heat. The entropy outflow 
must equal the entropy inflow. 


input heat than the amount converted to work, and to eject the excess input 
heat as waste heat, at a temperature lower than that of the input heat (Figure 8.1). 
Because dQ/do = +, the reversible heat transfer accompanying one unit of u rev (\)2 
entropy is given by the temperature at which the heat is transferred. It follows 
that only part of the input heat need be ejected at the lower temperature to 
carry away all the entropy of the input heat. Only the difference between input 
and output heat can be converted to work. To prevent the accumulation of * 
entropy there must be some output heat; therefore it is impossible to convert 
all the input heat to work! 

A prohibition against unlimited entropy accumulation in a device does not 
mean entropy cannot accumulate temporarily, provided that it is ultimately 
removed. Many practical energy-conversion devices operate in cycles, and the 
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entropy contained in the device varies periodically with time. Such a cyclic 
device is called a heat engine. The internal combustion engine is an example: 
The entropy contained in each cylinder is at a minimum near the beginning of 
the intake stroke and a maximum near the beginning of the exhaust stroke. 
There is a value of the entropy content to which the device returns cyclically; 
the entropy does not pile up indefinitely. 

What fraction of the input heat Q,, taken in during one cycle at the fixed higher 
temperature t,, can be converted into work? The input entropy associated with 
the input heat is ¢, = Q,,/t,. To avoid confusing signs, we define in this discus- 
sion all energy, heat, and entropy flows as positive whether the flow is into or 
out of the system, rather than following the usual convention according to 
which a flow is positive into the system and negative out of the system. If Q, is 
the waste heat leaving the system per cycle at the fixed lower temperature 7,, 
the output entropy per cycle is ¢, = Q,/t). In a reversible process this output 
entropy is equal to the input entropy: 


=o OF Q1/t1 = Qn/th » (4) 
so that 


Q: = (t/t )Qn- (5) 


The work generated during one cycle of a reversible process is the difference 
between the heat added and the waste heat extracted: 


th 


W =, ~ Q1= [1 - (n/n)]0, = 9 Hoe (6) 


t 


The ratio of the work generated to the heat added in the reversible process is 
called the Carnot efficiency : 


Ml 


| Ww Th — T T, — T; 
| tc = = = : (7) 
es (eile 


This quantity is named in honor of Sadi Carnot, who derived it in 1824. It was 
a remarkable feat: the concept of entropy had not yet been invented, and 
Carnot’s derivation preceded by some 15 years the recognition that heat is a 
form of energy. 

The Carnot efficiency is the highest possible value of the energy conversion 
efficiency 7 = W/Q,,, the output work per unit of input heat, in any cyclic heat 
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Figure 8.2 Entropy and energy flow in a real heat engine containing 
irreversibilities that generate new entropy inside the device. The 
entropy outflow at the lower temperature is larger than the entropy 
inflow at the higher temperature. 


engine that operates between the temperatures 1, and t,. Actual heat engines 
have lower efficiencies because the processes taking place within the device are 
not perfectly reversible. Entropy will be generated inside the device by irrevers- 
ible processes. The energy-entropy flow diagram is modified as in Figure 8.2. 
We now have three inequalities 


O, = Gy: (8) 


Q, = Oi(t1/t,); (9) 


w=0,-Q< *— A O, = ncQn- (10) 
h 
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The actual energy conversion efficiency 7 obeys the Carnot inequality 


| n= W/Q, < 1 — (t/t) = Ne- | (11) 


We can have 7 = nc only in the limit of reversible operation of a device that 
takes in heat at t), and ejects heat at 1). 

The Carnot inequality is the basic limitation on any heat engine that operates 
in a cyclic process. The result tells us that it is impossible to convert all input 
heat into work. For a given temperature ratio t/t, the highest conversion 
efficiency is obtained under reversible operation. The limiting efficiency in- 
creases with increasing t,/t,, but we attain 100 percent efficiency only when 
T/T + 0. 

The low-temperature waste heat of any heat engine must ultimately be 
ejected into the environment, so that t, cannot be below the environmental 
temperature, usually about 300 K. High efficiency requires an input temperature 
T,, high compared to 300 K. The usable temperatures in practice are unfortu- 
nately limited by various materials constraints. In power plant steam turbines, 
which are expected to operate continuously for years, the upper temperature 
is currently limited to about 600 K by problems with the strength and corrosion 
of steel. With T, = 300K and T,, = 600K, the Carnot efficiency is ne = 4, or 
50 percent. Losses caused by unavoidable irreversibilities reduce this efficiency 
typically to about 40 percent. To obtain higher efficiencies is a problem in high 
temperature metallurgy. 


Sources of irreversibility. Figure 8.3 illustrates several common sources of 
irreversibility: 


(a) Part of the input heat Q, may flow directly to the low temperature, by- 
passing the actual energy conversion process, as in the heat flow into the 
cylinder walls during the combustion cycle of the internal combustion 
engine. 

(b) Part of the temperature difference t, — t, may not be available as tem- 
perature difference in the actual energy conversion process, because of 
the temperature drop across thermal resistances in the path of the heat 
flow. 


(c) Part of the work generated may be converted back to heat by mechanical 
friction. 

(d) Gas may expand irreversibly without doing work, as in the irreversible 
expansion of an ideal gas into a vacuum. 


Refrigerators 


Thermal resistance 


Irreversible expansion 
without work or heat 


Net work 


Friction loss 


Thermal resistance 


Figure 8.3. Four sources of irreversibility in heat engines: heat flow 
bypassing the energy conversion process, thermal resistance in the path 
of the heat flow, frictional losses, and entropy generation during 
irreversible expansions. 


Refrigerators 


Refrigerators are heat engines in reverse. Refrigerators consume work to move 
heat from a low temperature t, to a higher temperature t,. Consider the energy- 
entropy flow diagram of a reversible heat engine in Figure 8.1. Because no 
entropy is generated inside the device, its operation can be reversed, with an 
exact reversal of the energy and entropy flows. Equations (4) through (6) 
remain valid for the reversed flows. 
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The energy ratio of interest in a refrigerator is not the energy conversion 
efficiency (7), but the ratio » = Q,/W of the heat extracted at the low tem- 
perature to the work consumed. This ratio is called the coefficient of refrigerator 
performance; its limiting value in reversible operation is called the Carnot 
coefficient of refrigerator performance, denoted by y-. Do not confuse y = Q,/W 
with » = W/Q, for the energy conversion efficiency of a heat engine; although 
n < 1 always, y can be >1 or <1. From Eq. (5) and W = Q, — Q,, the work 
consumed is 


W=0,-0,=*—9, (12) 


Qa ty T, 
lo = = = ; 13 
| us iE rev Th — TH Raa; “) 


This ratio can be larger or smaller than unity. 

Actual refrigerators, like actual heat engines, always contain irreversibilities 
that generate entropy inside the device. In a refrigerator this excess entropy is 
ejected at the higher temperature, as in the energy-entropy flow diagram of 
Figure 8.4. With the convention that all energy and entropy flows are positive, 
we now have 


O,2>0, (14) 
in place of (8). Further, 
Qn = (ty/tQr » (15) 
and 
W = Oy — 12 [leit ~ 10 = 901 = Dire 5 (16) 
so that 
[ite ages a Sor ree i] 


| y=Q/We ec. | (17) 
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Figure 8.4 Entropy and energy flow in a refrigerator. 


The Carnot coefficient y- is an upper limit to the actual coefficient of refrigerator 
performance », just as the Carnot efficiency n¢ is an upper limit to the actual 
energy conversion efficiency 7 of a heat engine. 

Both heat engines and refrigerators are subject to restrictions imposed by the 
law of increase of entropy, but the device design problems are totally different. 
In particular, the design of refrigerators to operate at the temperature of liquid 
helium or below is a challenging problem in thermal physics (Chapter 12). 


Air Conditioners and Heat Pumps 


Air conditioners are refrigerators that cool the inside of a building or an auto- 
mobile; the heat is ejected to the outside environment. If we interchange the 
inside and outside connections, an air conditioner can be used to heat a building 
during the winter. Such a device is called a heat pump. If t, — t, « 1, a heat 
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pump can heat the building with a lower consumption of energy than by direct 
heating (Problem 1). 

The limitations on the use of heat pumps are largely economical. They are 
much more costly to install and to maintain than are simple heaters or furnaces. 
Heat pumps make economic sense primarily in climatic conditions in which 
air conditioning is required anyway. 


Carnot Cycle 


The derivation of the Carnot energy conversion efficiency and of the Carnot 
coefficient of refrigerator performance made no statement about how to 
realize a process by which work is generated from heat, or about how refrigera- 
tion is achieved. The simplest and best known such process is the Carnot cycle. 
In the Carnot cycle a gas—or another working substance—is expanded and 
compressed in four stages, two isothermal and two isentropic, as in Figure 8.5. 
At point 1 the gas has the temperature t,, and the entropy o,. The gas is expanded 
at constant t until the entropy has increased to the value gy, at point 2. In the 
second stage the gas is further expanded, now at constant g, until the temperature 
has dropped to the value t,, at point 3. The gas is compressed isothermally to 
point 4 and then compressed isentropically to the original state 1. We write o, 
and oy for the low and high values of the entropy contained in the working 
substance, to distinguish these values from o, and a; which are the entropy 
flows per cycle at the low and high temperatures t, and t,. For the Carnot cycle, 
oO; = 0, = On — G4. 

The work done by the system in one cycle is the area of the rectangle in 
Figure 8.5: 


W =(t% — tn — G1), (18) 


which follows from 


au =0= ede - fpav , 


where $paVv is the work done by the system in one cycle. The heat taken up at 
t = 1, during the first phase is 


Qn = tO — o1). (19) 
We combine (18) and (19) to obtain the Carnot efficiency nc. Any process 


described by Figure 8.5 is called a Carnot cycle, regardless of the working 
substance. 


Carnot Cycle 


1 Th ca 


Figure 8.5 A Carnot cycle, for the conversion of heat into work, 
illustrated as a plot of entropy versus temperature, for an arbitrary 
working substance. The cycle consists of two expansion phases 

(1 + 2 and 2 — 3) and two compression phases (3 + 4 and 4 — 1). 
One of the expansion and one of the compression phases are 
isothermal (1 + 2 and 3 — 4), and one phase of each kind is 
isentropic (2 + 3 and 4 — 1). The net work done is the area of the 
loop. The heat consumed at t, is the area surrounded by the 
broken line. 


The Carnot cycle is a point of reference to indicate what could in principle 
be done, rather than what in fact is done. All energy conversion cycles need a 
high temperature input and a low temperature output of heat, but often the 
heat inputs and outputs are not well-defined reservoirs at constant temperatures. 
Even where such reservoirs exist, as in steam turbines, there is invariably a 
temperature difference present between the working substance and the reser- 
voirs. The heating and cooling processes are never truly reversible. 


Example: Carnot cycle for an ideal gas. We carry an ideal monatomic gas through a 
Carnot cycle. Initially the gas occupies a volume V, and is in thermal equilibrium with a 
reservoir ®, at the high temperature t,. The gas is expanded isothermally to the volume 
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V2, as in Figure 8.6a. In the process the gas absorbs the heat Q, from ®,, and delivers it as 
work W,, to an external mechanical system connected to the piston. For an ideal gas the 
heat absorbed from the reservoir is 


= We= Jrav = Naf dv/V = Nr, log(V2/V%). (20) 


This work is indicated by the area labeled “12.” Next, the gas is disconnected from ®,, and 
further expanded, now isentropically, until the temperature has dropped to the low tem- 
perature t,. In the process the additional work 


Wo3 = U(t,) — U(t,) = ¥N(t, — 1) (21) 


is delivered by the gas. The volume V; at the end of the isentropic expansion is related to 
V, by 


aV373 = 1,377, or Vz/¥ = (u/=)?? » (22) 


from (6.63). After point 3 the gas is brought into contact with a temperature reservoir ®, 
at the temperature t,, and then compressed isothermally (Figure 8.6b) to the volume V, 
chosen to satisfy 


Va/V, = (t/7)2)? = Vs/Vo . (23) 
so that V3/V, = V/V,. To accomplish this compression, the work 
W34 = Nt, log(V3/Vs) = Nt, log(V3/V;) (24) 
must be done on the gas. This work is ejected to ®, as heat: 
Q: = Wag. (25) 
Finally, the gas is disconnected from ®, and recompressed isentropically until its tempera- 
ture has risen to the initial temperature t,. Because of the choice (23) of V,, the gas volume 


at this point has returned to its initial value V,, and the cycle is completed. In this last stage 
the work 


Wa, = 3N(t, — 1) (26) 


is performed on the gas: this cancels the work W,, done by the gas during the isentropic 
expansion 2 — 3, by (21). 

The net work delivered by the gas during the cycle is given by the difference in shaded 
areas in Figures 8.6a and 8.6b, which is the enclosed area in Figure 8.6c. The isentropic 
curves in the p-V diagram are steeper than the isothermal curves, so that the area of the 
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Figure 8.6 The Carnot cycle for an ideal gas, as a p-V plot. An ideal gas is expanded 
and recompressed in four stages. Two of them are isothermal, at the temperature t, and 
t,(t, > t,). Two of them are isentropic, from t, to t,, and back. The shaded areas show 
(a) the work done during the two expansion stages, (b) the work done during the two 
compression stages, and (c) the net work done during the cycle. 
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loop is finite and is equal to the area of the rectangle in Figure 8.5. We have 


W = Wiz + Was — Waa — Way = Wir — Wag 
= N(t% — %)log(V2/V). (27) 


The heat absorbed from ®, was given in (20), so that W/Q, = (t, — t/t», Which is just 
the Carnot relation (7). 
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Energy Conversion and the Second Law of Thermodynamics 


The Carnot limits on the conversion of heat into work and on the performance 
of refrigerators are direct consequences of the law of increase of entropy. The 
second law of thermodynamics usually is formulated without mention of 
entropy. We stated the classical Kelvin-Planck formulation in Chapter 2: 
“It is impossible for any cyclic process to occur whose sole effect is the extraction 
of heat from a reservoir and the performance of an equivalent amount of work.” 

All reversible energy conversion devices that operate between the same tem- 
peratures have the same energy conversion efficiency 7 = W/Q,. Were this not 
so, we could combine two reversible devices with different efficiencies, y, < N2. 
in such a way (Figure 8.7) that device 1 with the lower efficiency is operated in 
reverse as a refrigerator that moves not only the entire waste heat Q,, from the 
more efficient device 2 back to the higher temperature t,, but an additional 
amount Q(in) of heat as well. The overall result would be the conversion of 
the heat Q(in) to work W(out), without any net waste heat. This would require 
the annihilation of entropy and would violate the law of increase of entropy. 

Now that we have established that all reversible devices that operate between 
the same temperatures have the same energy conversion efficiency, it is sufficient 
to calculate this efficiency for any particular device to find the common value. 
The Carnot cycle device leads to ne = (t, — ;)/t, for the common value. 


Path Dependence of Heat and Work 


We have carefully used the words heat and work to characterize energy transfer 
processes, and not to characterize properties of the system itself. It is not 
meaningful to speak of the heat content or of the work content of a system. 
We look at the Carnot cycle once more: Around a closed loop in the p-V 
plane, a net amount of work is generated by the system, and a net amount of 
heat is consumed. But the system—on being taken once around the loop—is 
returned to precisely the initial condition; no property of the system has 
changed. This means that there cannot exist two functions Q(c,V) and W(a,V) 


Path Dependence of Heat and Work 
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Figure 8.7 If two different reversible energy conversion devices operating between the 
same temperatures 1, and t, could have different energy conversion efficiencies (y > 11), 
it would be possible to combine them into a single device with 100 pct efficiency by 
using the less efficient device 1 as a refrigerator that moves not only the entire waste 
heat Q,, of the more efficient device 2 back to the higher temperature, but an additional 
amount Q(in) of heat as well. This additional heat would then be completely converted 
to work, 


such that the heat Q,, and the work W,, required to carry the system from a 
state (c,,V,) to a state (o,,V,) are given by the differences in Q and W: 


Quy = Q(G,V) — O(Ga,Va)s Way = Wa4,V) — W(oa,Va)- 


If such functions existed, the net transfers of heat and of work around a closed 
loop necessarily would be zero, and we have shown that the transfers are not 
zero. 

The transfers of heat and work between state (a) and state (b) depend on the 
path taken between the two states. This path-dependence is expressed when 
we say that heat and work are not state functions. Unlike temperature, entropy, 
and free energy, heat and work are not intrinsic attributes of the system. The 
increments dQ and dW that we introduced in (1) and (2) cannot be differentials 
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3, 3, 


3; 


Figure 8.8 Two irreversible processes in which mechanical or electrical potential 
energy is used to heat a system. 


of mathematical functions Q(¢,V) and W(o,V). For this reason we designated 
the increments by dQ and dW, rather than by dQ and dW. Without the path 
dependence of heat and work there would not exist cyclical processes that 
permit the generation of work from heat. 


Irreversible Work 


We consider the energy transfer processes of Figure 8.8. In each process 8, 
is a purely mechanical or electrical system that delivers pure work with zero 
entropy change. The energy transferred to 8, is converted to heat, either by 
mechanical friction or by electrical resistance. The final state of 8, is the 
same as if the energy had been added as heat in the first place. The entropy 
of 8, is increased by do, = dU,/t. This entropy is newly created entropy. 
Processes in which new entropy is created are irreversible because there is 
no way to reverse the process in order to destroy the newly created entropy. 
If newly created entropy arises by the conversion of work to heat, we say that 
irreversible work has been performed. 

If we look only at the net change in a system, there is no way to tell whether 
the process that led to this change was reversible or irreversible. For a change 
dU in energy and do in entropy, we can define a reversible heat dQ,,, and a 
reversible work @W,., as the amount of heat and work that would accomplish 
this change in a reversible process. If part of the work done on the system is 
irreversible, the actual work required to accomplish a given change is larger 


Irreversible Work 


than the reversible work, 
AWirrey > TWeey (28) 
By conservation of energy 
dU = AWierey + AQicrey = 1Wrey + AQrev 5 
so that 
Wirrey < WQrev- (29) 


The actual heat transferred in the irreversible process must be less than the 
reversible heat. 
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Example: Sudden expansion of an ideal gas. As an example of an irreversible process we 
consider once more the sudden expansion of an ideal gas into a vacuum. Neither heat nor 
work is transferred, so that dU = 0 and dr = 0. The final state is identical with the state 
that results from a reversible isothermal expansion with the gas in thermal equilibrium 
with a reservoir. The work W,,, done on the gas in the reversible expansion from volume 
V, to V, is, from (6.57), 


Wy = —Nrlog(V2/V;). (30) 


The work done on the gas is negative; the gas does positive work on the piston in an amount 
equal to the heat transfer into the system: 


On = ee 0; Wie <0: (31) 
The entropy change is equal to Q,,,/t, or 
62 — 0, = —W,.,/t = N log(V3/V)). (32) 
In the irreversible process of expansion into the vacuum this entropy is newly created 
entropy because neither heat nor work flows into the system from the outside: Wire. = 
Qirrey = 0. From (31) we obtain 


Wisrer > Weer — Qirrev < Qrev + (33) 


in agreement with (28) and (29). 
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area A, 


area A, 


PA, = prAy 


Figure 8.9 Systems between which only work but no 
heat is transferred need not be at the same temperature 
for the process to be reversible. 


In our discussion of irreversible work we assumed that the new entropy 
is created inside the system during the delivery of work to the system by other 
systems. This is not the only source of irreversibility in energy transfer. Pure 
heat transfer, not involving any work, is irreversible if it takes place between 
two systems having different temperatures. We worked out an example in 
Chapter 2. In this process heat is transferred from a system at tT, to a system 
at the lower temperature t,. We have 


dU, = dQ, =1%,do,; dU,;=dQ,=1%do,; dO, + dQ, =0. (34) 
The newly created entropy is 


Gig, = (doy + doz) = dQ,/t, + dQ,/z, 


= (I/t, — 1/e.)0, = 2— "49, (35) 
TT2 


The heat flow is from high to low temperature: dQ, is negative; t, — 1, is 
negative, so that doj,, > 0. 

The energy transfer between two systems with different temperatures need 
not be irreversible if only work but no heat is transferred (Figure 8.9). 

All actual energy transfer processes are invariably somewhat irreversible, 
but reversible processes remain the backbone of the theory of thermal physics. 
They constitute a natural limit, which is the equilibrium limit of vanishing 
entropy generation. We shall assume hereafter that the words heat and work, 
without a further qualifier, refer to reversible processes. 


Heat and Work at Constant Temperature or Constant Pressure 


HEAT AND WORK AT CONSTANT TEMPERATURE 
OR CONSTANT PRESSURE 


Isothermal work. We show that the total work performed on a system in 
a reversible isothermal process is equal to the increase in the Helmholtz free 
energy F = U — to of the system. For a reversible process dQ = tdo = d(to), 
because dt = 0, so that 


aW = dU — dQ = dU — d(to) = dF. (36) 


Thus in such processes the Helmholtz free energy is the natural energetic 
function, more appropriate than the energy U. When we treat an isothermal 
process in terms of the Helmholtz free energy, we automatically include the 
additional work that is required to make up for the heat transfer from the 
system to the reservoir. Often the heat transfer is the major part of the work: 
for the ideal gas the energy U does not change in an isothermal process, and 
the work done is equal to the heat transfer. 


Isobavic heat and work. Many energy transfer processes—isothermal or not— 
take place at constant pressure, particularly those processes that take place 
in systems open to the atmosphere. A process at constant pressure is said to be 
an isobaric process. A simple example is the boiling of a liquid as in Figure 8.10, 


F = PatmA 


Figure 8.10 When a liquid boils under atmospheric pressure, the vapor 
displacing the atmosphere does work against the atmospheric pressure. 
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where the pressure on the piston is the external atmospheric pressure. If the 
system changes its volume by dV, the work —pdV = —d(pV) is part of the 
total work done on the system. If positive, this work is provided by the environ- 
ment and is in this sense “free.” If negative, the work is delivered to the 
environment and is not extractable from the system for other purposes. For 
this reason it is often appropriate to subtract —d(pV) from the total work. We 
thus obtain the effective work performed on the system, defined as 


aW' = dW + d(pV) = dU + d(pV) — dQ = dH — dQ, (37) 
where we have defined a new function 
H=U+4+pV, (38) 


called the enthalpy which plays the role in processes at constant pressure that 
the energy U plays in processes at constant volume. The term pV in (38) is the 
work required to displace the surrounding atmosphere in order to vacate 
the space to be occupied by the system. Implicit in these definitions is the idea 
that there are other kinds of work besides that due to volume changes. 

Two classes of the constant pressure processes are particularly important: 

(a) Processes in which no effective work is done. The heat transfer is aQ = dH, 
from (37). The evaporation of a liquid (Chapter 10) from an open vessel is 
such a process, because no effective work is done. The heat of vaporization 
is the enthalpy difference between the vapor phase and the liquid phase. 

(b) Processes at constant temperature and constant pressure. Then dQ = 
tdo = d(ta), and the effective work performed on the system is, from (36) and 
(37), 


dW’ = dF + d(pV) =dG , (39) 
where we have defined another new function 
G=F+pV=U+pV¥V—t, (40) 


the Gibbs free energy. The effective work performed in a reversible process 
at constant temperature and pressure is equal to the change in the Gibbs 
free energy of the system. This is particularly useful in chemical reactions 
where the volume changes as the reaction proceeds at a constant pressure. 
The Gibbs free energy is used extensively in Chapter 9, and the enthalpy is 
used in Chapter 10. 


Heat and Work at Constant Temperature or Constant Pressure 


Example: Electrolysis and fuel cells. Electrolysis is a process that is both isothermal and 
isobaric. Consider an electrolyte of dilute sulfuric acid in which are immersed platinum 
electrodes that do not react with the acid (Figure 8.11). The sulfuric acid dissociates into 
H* and SO, ~~ ions: 


H,SO, = 2H* + SO," ~. (41) 


When a current is passed through the cell the hydrogen ions move to the negative electrode 
where they take up electrons and form molecular hydrogen gas: 


2H* + 2e7 > Hp. (42) 


The sulfate ions move to the positive electrodes where they decompose water with the 
release of molecular oxygen gas and electrons: 


SO,-~ + H,O > H,SO, + 40, + 2e7. (43) 
The sum of the above three steps is the net reaction equation in the cell: 
H,O > H, + 30. (44) 


When carried out slowly in a vessel open to the atmosphere, the process is at constant 
pressure and constant temperature. A negligible part of the electrical input power goes into 


Figure 8.11 An electrolysis cell. An electrical current passes through an electrolyte, such 
as dilute sulfuric acid. The overall result is the decomposition of water into gaseous 
hydrogen and oxygen. The process is an example of work being done at constant 
temperature and constant pressure. 
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resistance heating of the electrolyte. The effective work required to decompose 1 mole of 
water is related to the molar Gibbs free energies of the reactants: 


W’ = AG = G(H,0) — G(H,) — 4G(O,). (45) 


Chemical tables list the Gibbs free energy difference AG as —237kJ permole at room 
temperature. 

In electrolysis this work is performed by a current / that flows under an external voltage 
Vo. If tis the time required to decompose one mole of water, @ = J x tis the total charge 
(not the heat!) flowing through the cell, and we have 


W' = QV. (46) 


According to (43), there are two electrons involved in decomposing one water molecule, 
hence 


Q = —2Nye = —1.93 x 10° coulomb. (47) 


We equate (46) to (45) to obtain the condition for electrolysis to take place. This requires a 
minimum voltage 


Vo = —AG/2N4e, (48) 


or 1.229 volts. A voltage larger than V) must be applied to obtain a finite current flow, 
because Vo alone merely reduces to zero the potential barrier between the systems on the 
two sides of the reaction equation (44). When V > Vo, the excess power (V — Vo) x I will 
be dissipated as heat in the electrolyte. 

If V < Vo, the reaction (44) will proceed from right to left provided gaseous hydrogen is 
available at the positive electrode and gaseous oxygen at the negative electrode. In the 
simple setup of Figure 8.11 the gases are permitted to escape, and for V < Vo nothing will 
happen at all. It is possible, however, to construct the electrodes as porous sponges, with 
hydrogen and oxygen forced through under pressure (Figure 8.12). Such a device produces a 
voltage V, between the electrodes and, if the electrodes are connected, external current will 
flow. This arrangement is called a hydrogen-oxygen fuel cell. Fuel cells were used as power 
sources on board the Gemini and Apollo* spacecraft and incidentally produced drinking 
water for the astronauts. 

The principal technological limitation of fuel cells is their low current per unit electrode 
area. In the Apollo cell the current density was only a few hundred mA/cm?; hence large 
electrode areas are required to produce reasonable currents. The current-voltage charac- 
teristic of an electrochemical cell in its two operating ranges as fuel cell and as electrolytic 
cell are shown in Figure 8.13. 
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* The Apollo fuel cells used Ni and NiO rather than Pt as electrodes, and KOH rather than H SO, 
as electrolyte. For a detailed description, and more information on fuel cells, the reader is referred 
to J. O. M. Bockris and S. Srinivasan, Fuel cells: Their electrochemistry, McGraw-Hill, New York, 
1969. 


Porous electrodes 


Figure 8.12 A fuel cell is an electrolysis cell operated in 
reverse, with hydrogen and oxygen supplied as fuels. The 
fuels are forced under pressure through porous electrodes 
separated by an electrolyte. The hydrogen and oxygen react 
to form water; the excess Gibbs free energy is delivered 
outside as electrical energy. Water forms at the positive 
electrode and is removed there. 


Current J 


Fuel cell Electrolysis ——— 


Voltage V 


Figure 8.13 The current-voltage characteristic of an electrolytic 
cell or fuel cell, indicating the two operating ranges. 
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Chemical Work 


Work performed by the transfer of particles to a system is called chemical 
work, because it is associated with the chemical potential. 

When particles are transferred, the number of particles in the system is one 
of the independent variables on which the energy U depends. If U = U(a,V,N), 
then for a reversible process 


dU = tdo — pdV + wdN , (49) 


by the thermodynamic identity of Chapter 5. Here we have replaced the partial 
derivatives by their familiar equivalents (Table 5.1). By our definition of heat, 
the tdo term represents the transfer of heat and the —pdV and ydN terms 
represent the performance of work, all understood to be reversible: 


aW = —pdV + pwd. (50) 


The —pdV term is mechanical work; the xdN term is the chemical work: 
aW, = pdN. (51) 


If there is no volume change, dV = 0. All the work is chemical. 

In particle transfer there are usually two systems involved, both in contact 
with a heat reservoir, and the total chemical work is the sum of the contributions 
from both systems. In the arrangement of Figure 8.14 a pump transfers particles 
from system $, to system 8,. The chemical potentials are , and py. If 
dN = dN, = —dN, is the number of particles transferred, the total chemical 
work performed is 


dW, = dW., + dW. = wy dN, + b2dN2 = (M2 — My)AN. (52) 


The work that must be supplied to the pump is dW. if there is no volume work 
(dV, = dV, = 0), and if all processes are reversible. 

The result (52) gives an additional meaning of the chemical potential. We 
summarize the properties of the chemical potential: 


(a) The chemical potential of a system is the work required to transfer one 
particle into the system, from a reservoir at zero chemical potential. 

(b) The difference in chemical potential between two systems is equal to the 
net work required to move a particle from one system to the other. 


Chemical Work 


Figure 8.14 Chemical work is the work performed when particles are moved 
reversibly from one system to another, with the two systems having different 
chemical potentials. If the two volumes do not change, the work is pure 
chemical work; the amount per particle is the difference in chemical potentials. 


(c) Ifthe two systems are in diffusive equilibrium they have the same chemical 
potential; no work is required to move a particle from one system to 
the other. 

(d) The difference in internal chemical potential (Chapter 5) between two 
systems is equal but opposite to the potential barrier that maintains the 
systems in diffusive equilibrium. 
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Example: Chemical work for an ideal gas. We consider the work per particle required to 
move reversibly the atoms of a monatomic ideal gas from 8, with concentration n,, to 8 
with concentration n, > ,, both systems being at the same temperature (Figure 8.15). If 
dV = 0, the work contains only a chemical work term, which can be calculated from the 
difference in chemical potential, no matter how the process is actually performed. The 
chemical potential difference between two ideal gas systems with different concentrations is 


Hz — fy = t[log(nz/ng) — log(ny/ng)] = tlog(nz/n,). (53) 


This result is equal to the mechanical work per particle required to compress the gas 
isothermally from the concentration n, to the concentration n,. The work required to 
compress N particles of an ideal gas from an initial volume V, to a final volume V7 is 


We= —frav = —NtfavjV= Nrlog(V,/V2) = Ntlog(nz/n,). (54) 


Hence the mechanical work per particle is tlog(n2/n,), identical to the result (53). The 
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dw 


Energy exchanges 


Figure 8.15 Isothermal chemical work. The amount of chemical work per particle 
does not change if the process is performed isothermally with both systems in 
thermal equilibrium with a common large reservoir. 


identity of the chemical work with the isothermal compression work illustrates the equi- 
lence or convertibility of different kinds of work. 


Magnetic Work and Superconductors 


An important form of work is magnetic work. The most important application 
of magnetic work is to superconductors, and this application is treated here. 

Below some critical temperature T, that is usually less than 20K, many 
electrical conductors undergo a transition from their normal state with a finite 
electrical conductivity to a superconducting state with an apparently infinite 
conductivity. 

Superconductors expel magnetic fields from their interior. If the super- 
conductor is first cooled below the critical temperature and then inserted into a 
magnetic field, we might expect that the infinite conductivity would shield the 
interior from the penetration by a magnetic field. However, the expulsion occurs 
even if the superconductor is cooled below T, while in a magnetic field (Figure 
8.16). This active expulsion, called the Meissner effect, shows that supercon- 
ductivity is more than an infinite conductivity. The Meissner effect is caused by 
shielding currents that are spontaneously generated near the surface, in a layer 
about 107% cm thick. The magnetic field expulsion is not always complete. 
Superconductors are said to be of type II if the expulsion is incomplete, but still 
nonzero, in a range of fields above some low field. We shall restrict ourselves 
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Figure 8.16 Meissner effect in a superconducting sphere cooled 
in a constant applied magnetic field; on passing below the 
transition temperature the lines of induction B are ejected from 
the sphere. 


Figure 8.17 Threshold curves of the critical 
field versus temperature for several super- 
conductors. A specimen is superconducting 
below the curve and normal above the curve, 


BAT) in gauss 


Temperature, in K 


here to the discussion of type I superconductors, for which the field expulsion 
is complete up to a definite field and zero thereafter. 

A sufficiently strong magnetic field will destroy superconductivity. The 
critical field required to do this depends on the temperature and on the super- 
conductor. For type I superconductors the fields are usually a few hundred 
gauss (Figure 8.17). In some niobium and vanadium compounds of type II, 
critical fields of several hundred kilogauss have been observed. 

The Meissner magnetic effect shows that the normal and the superconducting 
states are different thermodynamic phases of the same metal, just as ice and 
liquid water are different phases of H,O, except that in the superconducting 
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! f_, eS a Area A 
1 Figure 8.19 A superconductor of length L and 
H B area A in a superconducting solenoid that 
WAYVATAYAVATAY) produces a magnetic field B, 


transition it is the electronic system rather than the crystal structure of the metal 
that undergoes a phase transition. 

The superconducting state is a distinct thermodynamic phase, as confirmed 
by differences in the heat capacity of the normal and the superconducting states. 
The heat capacity (Figure 8.18) exhibits a pronounced discontinuity at the onset 
of superconductivity at t = t.; when superconductivity is destroyed by a 
magnetic field, the discontinuity disappears. The stable phase will be the phase 
with the lower free energy. Below t = t, in zero magnetic field the free energy 
of the superconducting phase is lower than that of the normal phase. The free 
energy of the superconducting phase increases in the magnetic field, as we show 
below. The free energy of the normal phase is approximately independent of 
the field. Eventually, as the field is increased, the free energy of the super- 
conducting phase will exceed that of the normal phase. The normal phase is 
then the stable phase, and superconductivity is destroyed. 

The increase of the free energy of a superconductor in a magnetic field is 
calculated as the work required to reduce the magnetic field to zero in the interior 
of the superconductor; the zero value is required to account for the Meissner 
effect. Consider a superconductor in the form of a long rod of uniform cross- 
section inside a long solenoid that produces a uniform field B, as in Figure 8.19. 
The work required to reduce the field to zero inside the superconductor is 
equal to the work required to create within the superconductor a counteracting 
field — B that exactly cancels the solenoid field. We know from electromagnetic 
theory that the work per unit volume required to create a field B is given by 
(SD) Welt = Bo youas (55a) 


mag, 
or 


(CGS) Weag/V = B2/8n. (55b) 
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Figure 8.20 The free energy density Fy ofa 
nonmagnetic normal metal is approximately 
independent of the intensity of the applied 
magnetic field B,. At a temperature t < t, the 
metal is a superconductor in zero magnetic 
field, so that Fs(t,0) is lower than Fy(t,0). An 
applied magnetic field increases Fs by B,*/2it, 
in SI units (and by B,?/8z in CGS units), so 
that Fs(7,B,) = Fs(t,0) + B,?/2up. If B, is 
larger than the critical field B,. the free energy 
density is lower in the normal state than in 

the superconducting state, and now the normal 
state is the stable state. The origin of the 
vertical scale in the drawing is at Fs(r.0). The 
figure equally applies to Us and Uy att = 0. 


Free energy density 


Normal state 


B 


ac 


Applied magnetic field B, —— 


This is the amount by which the free energy density in the bulk superconductor 
is raised by application of an external magnetic field, in an experiment at con- 
stant temperature. 

There is no comparable free energy increase for the normal conductor, 
because there is no screening of the applied field. Thus 


(SI) [Fy@) — Fs()/V = BA(2)/2Ho; (56a) 
or 
(CGS) [Fy(t) — Fs(t)]/V = B,*(x)/8n. (56b) 


In a plot of the free energy density of both phases versus the magnetic field 
(Figure 8.20), the free energy of the superconducting phase will ultimately rise 
above that of the normal phase, so that in high fields the specimen will be in the 
normal phase, and the superconducting phase is no longer the stable phase. 
This is the explanation of the destruction of superconductivity by a critical 
magnetic field B,. 

With increasing temperature the free energy difference between normal and 
superconducting phase decreases as t > t,, and the critical magnetic field 
decreases. Everything else being equal, a high stabilization energy in a type I 
superconductor will lead to both a high critical temperature and a high critical 
field. The highest critical fields are found amongst the superconductors with the 
highest critical temperatures, and vice versa. 


Problems 


SUMMARY 


1, Heat is the transfer of energy by thermal contact with a reservoir. In a 
reversible process dQ = tdo. 


2. Work is the transfer of energy by a change in the external parameters that 
describe the system. The entropy transfer in a reversible process is zero when 
only work is performed and no heat is transferred. 


3. The Carnot energy conversion efficiency, n¢ = (t, — t,)/t, is the upper 
limit to the ratio W/Q,, of the work generated to the heat added. 


4. The Carnot coefficient of refrigerator performance, y¢ = t,/(t, — 1), is the 
upper limit to the ratio Q,/W of the heat extracted to the work consumed. 


5. The total work performed on a system at constant temperature in a reversible 
process is equal to the change in the Helmholtz free energy F = U — ta 
of the system. 


6. The effective work performed on a system at constant temperature and 
pressure in a reversible process is equal to the change in the Gibbs free 
energy G = U — ta + pV. 


7, The chemical work performed on a system in the reversible transfer of dN 
particles to the system is dN. 


8. The change in the free energy density of a superconductor (of type I) caused 
by an external magnetic field B is B*/219 in SI and B?/8z in CGS. 


PROBLEMS 


1. Heat pump. (a) Show that for a reversible heat pump the energy required 
per unit of heat delivered inside the building is given by the Carnot efficiency (6): 


Ww hm —- Ty 
7 ue : 


Q,, Th 


What happens if the heat pump is not reversible? (b) Assume that the electricity 
consumed by a reversible heat pump must itself be generated by a Carnot engine 
operating between the temperatures t,,, and t,. What is the ratio Q),,/Q,, of the 
heat consumed at t,,, to the heat delivered at t,? Give numerical values for 
Tm = 600K; T, = 300K; T, = 270K. (c) Draw an energy-entropy flow 
diagram for the combination heat engine-heat pump, similar to Figures 8.1, 
8.2 and 8.4, but involving no external work at all, only energy and entropy flows 
at three temperatures. 
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2. Absorption refrigerator. In absorption refrigerators the energy driving the 
process is supplied not as work, but as heat from a gas flame at a temperature 
Thh > T. Mobile home and cabin refrigerators may be of this type, with propane 
fuel. (a) Give an energy-entropy flow diagram similar to Figures 8.2 and 8.4 for 
such a refrigerator, involving no work at all, but with energy and entropy flows 
at the three temperatures t,,, > t, > t. (b) Calculate the ratio Q,/Q,,,, for the 
heat extracted att = t;, where Q,, is the heat input at t = t,,,. Assume reversible 
operation. 


3. Photon Carnot engine. Consider a Carnot engine that uses as the working 
substance a photon gas. (a) Given 1, and t, as well as V, and V2, determine 
V; and V,. (b) What is the heat Q, taken up and the work done by the gas during 
the first isothermal expansion? Are they equal to each other, as for the ideal gas? 
(c) Do the two isentropic stages cancel each other, as for the ideal gas? (d) Calcu- 
late the total work done by the gas during one cycle. Compare it with the heat 
taken up at 1, and show that the energy conversion efficiency is the Carnot 
efficiency. 


4. Heat engine—refrigerator cascade. The efficiency of a heat engine is to be 
improved by lowering the temperature of its low-temperature reservoir to a 
value t,, below the environmental temperature t,, by means ofa refrigerator. The 
refrigerator consumes part of the work produced by the heat engine. Assume 
that both the heat engine and the refrigerator operate reversibly. Calculate the 
ratio of the net (available) work to the heat Q, supplied to the heat engine at 
temperature t,. Is it possible to obtain a higher net energy conversion efficiency 
in this way? 


5. Thermal pollution. A river with a water temperature T; = 20°C is to be 
used as the low temperature reservoir of a large power plant, with a steam 
temperature of T,, = 500°C. If ecological considerations limit the amount of 
heat that can be dumped into the river to 1500 MW, what is the largest electrical 
output that the plant can deliver? If improvements in hot-steam technology 
would permit raising T, by 100°C, what effect would this have on the plant 
capacity? 

6. Room air conditioner. A room air conditioner operates as a Carnot cycle 
refrigerator between an outside temperature 7, and a room at a lower tempera- 
ture T;. The room gains heat from the outdoors at a rate A(T, — T,); this heat 
is removed by the air conditioner. The power supplied to the cooling unit is P. 
(a) Show that the steady state temperature of the room is 


T, = (Ty + P/2A) — [(T, + P/2A)? — T,2]!?. 


(b) If the outdoors is at 37°C and the room is maintained at 17°C by a cooling 
power of 2kW, find the heat loss coefficient A of the room in WK~'. A good 


Problems 


discussion of room air conditioners is given by H. S. Leff and W. D. Teeters, 
Amer. J. Physics 46, 19 (1978). In a realistic unit the cooling coils may be at 
282 K and the outdoor heat exchanger at 378K. 


7. Light bulb in a refrigerator. A 100W light bulb is left burning inside a 
Carnot refrigerator that draws 100 W. Can the refrigerator cool below room 
temperature? 


8. Geothermal energy. A very large mass M of porous hot rock is to be 
utilized to generate electricity by injecting water and utilizing the resulting hot 
steam to drive a turbine. As a result of heat extraction, the temperature of the 
rock drops, according to dQ, = —MCdT,, where C is the specific heat of the 
rock, assumed to be temperature independent. If the plant operates at the Carnot 
limit, calculate the total amount W of electrical energy extractable from the 
rock, if the temperature of the rock was initially T,, = T;, and if the plant is to 
be shut down when the temperature has dropped to T,, = T,. Assume that the 
lower reservoir temperature T, stays constant. 

At the end of the calculation, give a numerical value, in kWh, for M = 10'*kg 
(about 30km*), C =1Jg™!K~!, T; = 600°C, T, = 110°C, T, = 20°C. 
Watch the units and explain all steps! For comparison: The total electricity 
produced in the world in 1976 was between 1 and 2 times 10'* kWh. 


9. Cooling of nonmetallic solid to T =0. We saw in Chapter 4 that the heat 
capacity of nonmetallic solids at sufficiently low temperatures is proportional 
to T*, as C = aT*. Assume it were possible to cool a piece of such a solid to 
T = 0 by means of a reversible refrigerator that uses the solid specimen as its 
(varying!) low-temperature reservoir, and for which the high-temperature 
reservoir has a fixed temperature T,, equal to the initial temperature T; of the 
solid. Find an expression for the electrical energy required. 


10. Irreversible expansion of a Fermi gas. Consider a gas of N noninteracting, 
spin 4 fermions of mass M, initially in a volume V; at temperature t, = 0. Let the 
gas expand irreversibly into a vacuum, without doing work, to a final volume 
V,. What is the temperature of the gas after expansion if V; is sufficiently large 
for the classical limit to apply? Estimate the factor by which the gas should be 
expanded for its temperature to settle to a constant final value. Give numerical 
values for the final temperature in kelvin for two cases: (a) a particle mass equal 
to the electron mass, and N/V = 10°? cm~*, as in metals; (b) a particle mass 
equal to a nucleon, and N/V = 10°°, as in white dwarf stars. 
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Chapter 9: Gibbs Free Energy and Chemical Reactions 


GIBBS FREE ENERGY 


The Helmholtz free energy F introduced in Chapter 3 describes a system at 
constant volume and temperature. But many experiments, and in particular 
many chemical reactions, are performed at constant pressure, often one atmo- 
sphere. It is useful to introduce another function to treat the equilibrium 
configuration at constant pressure and temperature. As in Chapter 8, we 
define the Gibbs free energy G as 


Iu... =e 
G=U —t0 + pV. 


| | «) 


Chemists often call this the free energy, and physicists often call it the 
thermodynamic potential. 

The most important property of the Gibbs free energy is that it is a minimum 
for a system 8 in equilibrium at constant pressure when in thermal contact 
with a reservoir ®. The differential of G is 


dG = dU — tdo — odt + pdV + Vap. 


Consider a system (Figure 9.1) in thermal contact with a heat reservoir ®, 
at temperature t and in mechanical contact with a pressure reservoir ®, that 
maintains the pressure p, but cannot exchange heat. Now dt = 0 and dp = 0, 
so that the differential dG of the system in the equilibrium configuration becomes 


dG, = dUs — tdas + pdVy. (2) 
The thermodynamic identity (5.39) is 
tdos = dUs — pwdNgs + pdVs. (3) 


so that (2) becomes dG; = pdN. But dN, = 0, whence 


iG; = 0, | (4) 
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Heat reservoir Heat reservoir 


Figure 9.1 


A system in thermal equilibrium 


R, Ry with a heat reservoir and in mechanical 


Plunger to 
equalize pressure 


insulated. 


Pressure reservoir 
Ry 


equilibrium with a barystat or pressure 
reservoir which maintains a constant pressure 
on the system. The barystat is thermally 


which is the condition for Gs to be an extremum with respect to system varia- 
tions at constant pressure, temperature, and particle number. These are, there- 


fore, the natural variables for G(N,t,p). 


That the extremum of Gs must be a minimum, rather than a maximum, 
follows directly from the minus sign associated with the entropy in (1): Any 
irreversible change taking place entirely within 8 will increase ¢ and thus 


decrease G,. 
With (2), 


| dG = pdN — odt + Vdp. | 


The differential (5) may be written as 


0G 6G 0G 
dG = (Fa),,0 + (F)# + (32) ae. 


Comparison of (5) and (6) gives the relations 


(@G/@N),., = 1: 


(0G/6t)y,» = —o; 


(5) 


(6) 


2) 


(8) 
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(0G/ép)y,. = V. (9) 


Three Maxwell relations may be obtained from these by cross-differentiation; 
see Problem 1. 

In the Gibbs free energy G = U — to + pV the variables t and pare intensive 
quantities : they do not change value when two identical systems are put together. 
But U, o, V, and G are linear in the number of particles N: their value doubles 
when two identical systems are put together, apart from interface effects. We 
say that U,o, V, N and G are extensive quantities. Assume that only one particle 
species is present. If G is directly proportional to N, we must be able to write 


G = Nog(p,t) . (10) 
where ¢ is independent of N because it is a function only of the intensive 


quantities p and t. If two identical volumes of gas at equal pressure and tempera- 
ture, each with 4N molecules, are put together, the Gibbs free energy 


G = 4No(p.t) + 4Ne(p.t) = Ne(p,t) 
does not change in the process. It follows from this argument that 
(@G/ON),,. = Q(P.t). (11) 
We saw in (7) that 
(6G/ON),.. = hs (12) 


so that g must be identical with ju, and (10) becomes 


G(N,p,t) = Nu(p,t). (13) 


Thus the chemical potential for a single-component system is equal to the 
Gibbs free energy per particle, G/N. For G for an ideal gas, see (21) below. 

If more than one chemical species is present, (13) is replaced by a sum over 
all species: 


G= DN sh. (14) 
i 


Gibbs Free Energy 


The thermodynamic identity becomes 

tdo = dU + pdV —Y wdN;; (15) 
and (5) becomes 

dG = Y wjdN, — odt + Vap. (16) 


We shall develop the theory of chemical equilibria by exploiting the property 
that G = )N ju; is a minimum with respect to changes in the distribution of 
reacting molecules at constant t, p. No new atoms come into the system in a 
reaction; the atoms that are present redistribute themselves from one molecular 
species to another molecular species. 


Example: Comparison of G with F. Let us see what is different about the two relations 


(OF/ON),,y = W(N,t.V) (17) 
and 
(0G/EN),,p = u(t,p). (18) 
We found in (6.18) that for an ideal gas 
H(N,t,V) = tlog(N/Vig) , (19) 
so that 4(N,z,V) is not independent of N and therefore we cannot write F = Nu(t.V) as 
the integral of (17). 


That is, F is not directly proportional to N if the system is kept at constant volume as the 
number of particles is increased. Instead, from (6.24), 


F(t,V,N) = Nr[log(N/Vig) — 1]. (20) 
But the Gibbs free energy for the ideal gas is 


G(t,p,.N) = F + pV = Nr[log(p/tng) — 1] + Nt 
Nt log(p/tng) , (21) 


by use of the ideal gas law in the form N/V = p/r. We readily identify in (21) the chemical 
potential as 


u(t,p) = tlog(p/tng) , (22) 
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by reference to the result G = Ny(t,p). We see that N appears unavoidably in y(z,V) in 
(19), but not in y:(z,p) in (22). The chemical potential is the Gibbs free energy per particle, 
but it is not the Helmholtz free energy per particle. Of course, we are free to write as either 
(19) or (22), as is convenient. 


EQUILIBRIUM IN REACTIONS 
We may write the equation of a chemical reaction as 

vA, + v2A2 +°°° + WA, =0, (23) 
or 


Vv,4)=0, (24) 
i 


where the A; denote the chemical species, and the v; are the coefficients of the 
species in the reaction equation. Here v is the Greek letter nu. For the reaction 
H, + Cl, = 2HCI we have 


A; = H2; A, = Ch; As = HCl; vy, =1; v2 = 1; v3 = —2. 
(25) 


The discussion of chemical equilibria is usually presented for reactions under 
conditions of constant pressure and temperature. In equilibrium the Gibbs free 
energy is a minimum with respect to changes in the proportions of the reactants. 
The differential of G is 


dG = ¥ pjdNj — odt + Vadp. (26) 
i 


Here j1; is the chemical potential of species j, as defined by 4; = (6G/ON)),p. At 
constant pressure dp = 0 and at constant temperature dt = 0; then (26) reduces 
to 


dG = ¥ wAN;. (27) 
J 


The change in the Gibbs free energy in a reaction depends on the chemical 
potentials of the reactants. In equilibrium G is an extremum and dG must be 
zero. 


Equilibrium for Ideal Gases 


The change dN; in the number of molecules of species j is proportional to the 
coefficient v; in the chemical equation }’ vA; = 0. We may write dN; in the form 


dN, =v,dN , (28) 


where dN indicates how many times the reaction (24) takes place. The change dG 
in (27) becomes 


dG = (5 7) aN. (29) 
i 
In equilibrium dG = 0, so that 


Lyn = 0. (30) 
J 


This is the condition for equilibrium in a transformation of matter at constant 
pressure and temperature.* 


Equilibrium for Ideal Gases 


We obtain a simple and useful form of the general equilibrium condition 
>); = 0 when we assume that each of the constituents acts as an ideal gas. 
We utilize (6.48) to write the chemical potential of species j as 


Hy = t(logn; — loge;) , (31) 
where n, is the concentration of species j and 
Cj = ngZ,(int) , (32) 


which depends on the temperature but not on the concentration. Here Z,(int) 
is the internal partition function, (6.44). Then (30) can be rearranged as 


7) logn, = Ly loge; , (33a) 
J 


* But the result is more general: once equilibrium is reached, the reaction does not proceed further, 
and there is no further change in the thermal average values of the concentrations. The volume at 
equilibrium will be known, so that the condition (30) applies as well when V and z are specified as 
when p and t are specified. 
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or as 
> logn;’ = ¥ loge;’. (33b) 
i i 

The left-hand side can be rewritten as 

logn;”’ = log] nj" . (33c) 
if Fj 

and the right-hand side can be expressed as 


log[ Je; = log K(7). (33d) 
j 


Here K(t), called the equilibrium constant, is a function only of the temperature. 
With (32) we have 


K(z) = []ng;’exp[ —v,F (int)/z] . (34) 
j 
because the internal free energy is F (int) = —tlogZ,(int). From (33c,d) and 
(34) we have 
T]nj! = Kt). (35) 
i 


known as the law of mass action. The result says that the indicated product of the 
concentrations of the reactants is a function of the temperature alone. A change 
in the concentration of any one reactant will force a change in the equilibrium 
concentration of one or more of the other reactants. 

To calculate the equilibrium constant K(z) in (34), it is essential to choose in 
a consistent way the zero of the internal energy of each reactant. We need 
consistency here because the value of each partition function Z,(int) depends 
on our choice of the zero of the energy eigenstates. The different zeros for the 
different reactants must be related to give properly the energy or free energy 
difference in the reaction. It is not difficult to arrange this, but it does not 
happen without a conscious effort on our part. For a dissociation reaction such 
as H, = 2H, the simplest procedure is to choose the zero of the internal energy 
of each composite particle (here the H, molecule) to coincide with the energy 
of the dissociated particles (here 2H) at rest. Accordingly, we place the energy 
of the ground state of the composite particle at — E,, where E, is the energy 


Equilibrium for Ideal Gases 


required in the reaction to dissociate the composite particle into its constituents 
and is taken to be positive. 


Example: Equilibrium of atomic and molecular hydrogen. The statement of the law of 
mass action for the reaction H, = 2H or H; — 2H = 0 for the dissociation of molecular 
hydrogen into atomic hydrogen is 


[HAtH)? = ed 


Here [H,] denotes the concentration of molecular hydrogen, and [H] the concentration 
of atomic hydrogen. It follows that 


= K(t). (36) 


(H] _ A sexy 
[Ho] 7 [H,]*?K"?’ (37) 


that is, the relative concentration of atomic hydrogen at a given temperature is inversely 
proportional to the square root of the concentration of molecular hydrogen, The equilib- 
rium constant K is given by 


log K = logng(H2) — 2logng(H) — F(H3)/t , (38) 


in terms of the internal free energy of H2, per molecule. Spin factors are absorbed in F(H )). 
Here the zero of energy is taken for an H atom at rest. The more tightly bound is H;, the 
more negative is F(H,), and the higher is K, leading to a higher proportion of H; in the 
mixture. The energy to dissociate H, is 4.476 eV per molecule, at absolute zero. 

It may be said that the dissociation of molecular hydrogen into atomic hydrogen is an 
example of entropy dissociation: The gain in entropy associated with the decomposition 
of H; into two independent particles compensates the loss in binding energy. It is believed 
that most of the hydrogen in intergalactic space is present as H and not H,: The reaction 
equilibrium is thrown in the direction of H by the low values of the concentration of H). 
Hydrogen is very dilute in intergalactic space. 


2 EES SO Se ee eee 
Example: pH and the ionization of water. In liquid water the ionization process 
H,0-H* + OH™ (39) 


proceeds to a slight extent. At room temperature the reaction equilibrium is described 
approximately by the concentration product 


fee | OHs | =105**molel-?, (40) 
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where the ionic concentrations are given in moles per liter. In pure water [H*] = [OH™] = 
10-7 mol17!. An acid is said to act as a proton donor. The concentration of H* ions is 
increased by adding an acid to the water and the concentration of OH~ ions will decrease 
as required to maintain the product [H~ ][OH~ ] constant. Similarly, the concentration 
of OH™ ions can be increased by adding a base to the water, and the H* concentration will 
decrease accordingly. The physical state of water is more complicated than the equation 
of the ionization process suggests—the H~ ions are not bare protons, but are associated 
with groups* of HO molecules, This does not significantly affect the validity of the reaction 
equation. 

It is often convenient to express the acidity or alkalinity of a solution in terms of the pH, 
defined as 


pH = —log,o[H*]. | (41) 


The pH ofa solution is the negative of the logarithm base ten of the hydrogen ion concentra- 
tion in moles per liter of solution. The pH of pure water is 7 because [H*] = 1077 mol1~*. 
The strongest acidic solutions have pH near 0 or even negative; an apple may have pH ~ 3. 
Human blood plasma has a pH of 7.3 to 7.5; it is slightly basic. 


Example: Kinetic model of mass action. Suppose that atoms A and B combine to form a 
molecule AB. We suppose that AB is formed in a biatomic collision of A and B. Let nq, 
Ng, Nay denote the concentrations of A, B, and AB respectively. The rate of change of nap is 


dngp/dt = Cnang — Drag . (42) 


where the rate constant C describes the formation of AB in a collision of A with B, and the 
rate constant D describes the reverse process, the thermal decay of AB into its component 
atoms A and B. In thermal equilibrium the concentrations of all constitutents are constant, 
so that dng,/dt = 0 and 


Cngng = Drgg: NaNg/Nag = D/C , (43) 


a function of temperature only. This result is consistent with the law of mass action that we 
derived earlier by standard thermodynamics. 

Suppose AB is not formed principally by the bimolecular collision of A and B, but is 
formed by some catalytic process such as 


A + E+>AE; AE + B<>AB + E. (44) 


* The dominant species present is most likely H*- 4H ,O, a complex of 4 water molecules surround- 
ing one proton. A review is given by M. Eigen and L. De Maeyer, Proc, Roy. Soc. (London) A247, 
505 (1958). 
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Here E is the catalyst which is returned to its original state at the end of the second step. 
So long as the intermediate product AE is so short lived that no significant quantity of A 
is tied up as AE, the ratio nang/M,y in equilibrium must be the same as if AB were formed in 
the direct process A + B++AB treated above. No matter by what route the reaction 
actually proceeds, the equilibrium must be the same. The rates, however, may differ. 

The equality in equilibrium of the direct and inverse reaction rates is called the principle 
of detailed balance. 


Comment: Reaction rates. The law of mass action expresses the condition satisfied by the 
concentrations once a reaction has gone to equilibrium. It tells us nothing about how fast 
the reaction proceeds. A reaction A + B = C may evolve energy AH as it proceeds, but 
before the reaction can occur A and B may have to negotiate a potential barrier, as in 
Figure 9.2. The barrier height is called the activation energy. Only molecules on the high 
energy end of their energy distribution will be able to react; others will not be able to get 
over the potential hill. A catalyst speeds up a reaction by offering an alternate reaction path 
with a lower energy of activation, but it does not change the equilibrium concentrations. 


eS SS TS ESS 


Activation 
energy 


Potential energy 


Schematic coordinate 


Figure 9.2. The quantity AH measures the energy evolved in the reaction 
and determines the equilibrium concentration ratio [A ][B]/[C]. The 
activation energy is the height of the potential barrier to be negotiated 
before the reaction can proceed, and it determines the rate at which the 
reaction takes place. 
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SUMMARY 
1. The Gibbs free energy 
G=U-134+ pV 


is a minimum in thermal equilibrium at constant temperature and pressure. 
2. (0G/0t)y» = —a: (8G /@p)y,, = V; (@G/ON),., = Mb. 
3. G(t,p,.N) = Nu(c,p) 


4. The law of mass action for a chemical reaction is that 
T]n;" = K(), 


a function of the temperature alone. 


PROBLEMS 


1. Thermal expansion near absolute zero. (a) Prove the three Maxwell rela- 
tions 


(éV/ét), = —(€a/dp), , (45a) 
(OV/EN), = +(2u/ep)y (45b) 
(0u/0t)y = —(6a/EN),. (45c) 


Strictly speaking, (45a) should be written 
(0V/et)).n = —(60/ép)..v 5 
and two subscripts should appear similarly in (45b) and (45c). It is common to 


omit those subscripts that occur on both sides of these equalities. (b) Show with 
the help of (45a) and the third law of thermodynamics that the volume coeffi- 


cient of thermal expansion 
1 (eV 
et 7 it si 


approaches zero as t > 0. 


Problems 


2. Thermal ionization of hydrogen. Consider the formation of atomic hy- 
drogen in the reaction e + H* = H, where e is an electron, as the adsorption 
of an electron on a proton H~. (a) Show that the equilibrium concentrations 
of the reactants satisfy the relation 


(e][H"]/[H] = ngexp(—1/2) , (47) 


where / is the energy required to ionize atomic hydrogen, and ng = (mt/2nh*)?/? 


refers to the electron. Neglect the spins of the particles; this assumption does 
not affect the final result. The result is known as the Saha equation. If all the 
electrons and protons arise from the ionization of hydrogen atoms, then the 
concentration of protons is equal to that of the electrons, and the electron 
concentration is given by 


[e] = [H]*ng*? exp(—1/2r). (48) 


A similar problem arises in semiconductor physics in connection with the 
thermal ionization of impurity atoms that are donors of electrons. 
Notice that: 


(1) The exponent involves 4/ and not J, which shows that this is not a simple 
“Boltzmann factor” problem. Here / is the ionization energy. 

(2) The electron concentration is proportional to the square root of the 
hydrogen atom concentration. 

(3) If we add excess electrons to the system, then the concentration of protons 
will decrease. 


(b) Let [H(exc)] denote the equilibrium concentration of H atoms in the 
first excited electronic state, which is #/ above the ground state. Compare 
[H(exc)] with [e] for conditions at the surface of the Sun, with [H] ~ 10** cm~* 
and T ~ 5000K. 


3. Ionization of donor impurities in semiconductors. A pentavalent impurity 
(called a donor) introduced in place of a tetravalent silicon atom in crystalline 
silicon acts like a hydrogen atom in free space, but with e”/e playing the role of 
e? and an effective mass m* playing the role of the electron mass m in the 
description of the ionization energy and radius of the ground state of the 
impurity atom, and also for the free electron. For silicon the dielectric constant 
€ = 11.7 and, approximately, m* = 0.3 m. If there are 10'7 donors per cm’, 
estimate the concentration of conduction electrons at 100 K. 


4. Biopolymer growth. Consider the chemical equilibrium of a solution 
of linear polymers made up of identical units. The basic reaction step is 
monomer + Nmer = (N + 1)mer. Let Ky denote the equilibrium constant for 
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this reaction. (a) Show from the law of mass action that the concentrations fq 
satisfy 


[N + 1] = [1]"*1/K,K2K3--+ Ky (49) 


(b) Show from the theory of reactions that for ideal gas conditions (an ideal 
solution): 


_ No(N)no(1) el she 
<TD 1 OxPL w+ Fy — F,)/z]. (50) 
Here 
No(N) = (2nh?/Myt)~3? , (51) 


where My is the mass of the Nmer molecule, and F. y is the free energy of one 
Nmer molecule. (c) Assume N » 1, so that No(N) = ng(N + 1). Find the 
concentration ratio [N + 1]/[N] at room temperature if there is zero free 
energy change in the basic reaction step: that is, ifAF = Fy; — Fy — F, = 0. 
Assume [1] = 10°°cm~*, as for amino acid molecules in a bacterial cell. The 
molecular weight of the monomer is 200. (d) Show that for the reaction to goin 
the direction of long molecules we need AF < —0.4eV, approximately. This 
condition is not satisfied in Nature, but an ingenious pathway is followed that 
simulates the condition. An elementary discussion is given by C. Kittel, Am. J. 
Phys. 40, 60 (1972). 


5. Particle-antiparticle equilibrium, (a) Find a quantitative expression for the 
thermal equilibrium concentration n = n* = n~ in the particle-antiparticle 
reaction A* + A~ = 0. The reactants may be electrons and positrons; protons 
and antiprotons; or electrons and holes in a semiconductor. Let the mass of 
either particle be M; neglect the spins of the particles. The minimum energy 
release when A* combines with A~ is A. Take the zero of the energy scale as the 
energy with no particles present. (b) Estimate n in cm~® for an electron (or a 
hole) in a semiconductor T = 300K with a A such that A/t = 20. The hole is 
viewed as the antiparticle to the electron. Assume that the electron concentration 
is equal to the hole concentration: assume also that the particles are in the 
classical regime. (c) Correct the result of (a) to let each particle have a spin of $. 
Particles that have antiparticles are usually fermions with spins of 4. 
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VAPOR PRESSURE EQUATION 


The curve of pressure versus volume for a quantity of matter at constant 
temperature is determined by the free energy of the substance. The curve is 
called an isotherm. We consider the isotherms of a real gas in which the atoms 
or molecules interact with one another and under appropriate conditions can 
associate together in a liquid or solid phase. A phase is a portion of a system 
that is uniform in composition. 

Two phases may coexist, with a definite boundary between them. An isotherm 
ofa real gas may show a region in the p-V plane in which liquid and gas coexist 
in equilibrium with each other. As in Figure 10.1, part of the volume contains 
atoms in the gas phase. There are isotherms at low temperatures for which 
solid and liquid coexist and isotherms for which solid and gas coexist. Everything 
we say for the liquid-gas equilibrium holds also for the solid-gas equilibrium 
and the solid-liquid equilibrium. 

Liquid and vapor* may coexist on a section of an isotherm only if the 
temperature of the isotherm lies below a critical temperature t,.. Above the 
critical temperature only a single phase—the fluid phase—exists, no matter 
how great the pressure. There is no more reason to call this phase a gas than 
a liquid, so we avoid the issue and call it a fluid. Values of the critical temperature 
for several gases are given in Table 10.1. 

Liquid and gas will never coexist along the entire extent of an isotherm 
from zero pressure to infinite pressure; they coexist at most only along a 
section of the isotherm. For a fixed temperature and fixed number of atoms, 
there will be a volume above which all atoms present are in the gas phase. 
A small drop of water placed in an evacuated sealed bell jar at room temperature 
will evaporate entirely, leaving the bell jar filled with H,0 gas at some pressure. 
A drop of water exposed to air not already saturated with moisture may 
evaporate entirely. There is a concentration of water, however, above which 
the atoms from the vapor will bind themselves into a liquid drop. The volume 
relations are suggested by Figure 10.1. 

The thermodynamic conditions for the coexistence of two phases are the 
conditions for the equilibrium of two systems that are in thermal, diffusive, 


* Vapor is a term used for a gas when the gas is in equilibrium with its liquid or solid form. 


Vapor Pressure Equation 


Curve taken at 
constant temperature 


pind 


Liquid Liquid + gas Gas 


Figure 10.1 Pressure-volume isotherm of a real gas at a 
temperature such that liquid and gas phases may coexist, that 
is, t < t,. In the two-phase region of liquid + gas the pressure 
is constant, but the volume may change. At a given temperature 
there is only a single value of the pressure for which a liquid 
and its vapor are in equilibrium. If at this pressure we move the 
piston down, some of the gas is condensed to liquid, but the 
pressure remains unchanged as long as any gas remains. 


Table 10.1 Critical temperatures of gases 


T,, in K T,, inK 
He 52 H, 33.2 
Ne 44.4 N; 126.0 
Ar 151 0; 154.3 
Kr 210 H,0 647.1 
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and mechanical contact. These conditions are that T = 123 My = 2; Pi = D2 
or, for liquid and gas, 


T=) th = Mgs Pr = Pg > (1) 


where the subscripts / and g denote the liquid and gas phases. Note that the 
chemical potentials of the same chemical species in the two phases must be 
equal if the phases coexist. The chemical potentials are evaluated at the common 
pressure and common temperature of the liquid and gas, so that 


| wi(p.2) = (0,2) Q) 


At a general point in the p-t plane the two phases do not coexist: If My < My 
the liquid phase alone is stable, and if x, < 1, the gas phase alone is stable. 
Metastable phases may occur, by supercooling or superheating. A metastable 
phase may have a transient existence, sometimes brief, sometimes long, at a 
temperature for which another and more stable phase of the same substance 
has a lower chemical potential. 


Derivation of the Coexistence Curve, p Versus t 


Let po be the pressure for which two phases, liquid and gas, coexist at the 
temperature to. Suppose that the two phases also coexist at the nearby point 
Po + dp;to + dt. The curve in the p, t plane along which the two phases coexist 
divides the p, t plane into a phase diagram, as given in Figure 10.2 for H,O. 
It is a condition of coexistence that 


Hg(Posto) = Hi{PosTo) » (3) 
and also that 


Hg(Po + dp, to + dt) = (Po + dp, t) + dt). (4) 


Equations (3) and (4) give a relationship between dp and dr. 
We make a series expansion of each side of (4) to obtain 


on ou, " 
Hg(Posto) + (2) ap + (et) ae ++ 
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Figure 10.2. Phase diagram of H,O. The 


= 150 relationships of the chemical potentials y,, 4, 
5 “A and 1, in the solid, liquid, and gas phases are 
ies Liquid shown. The phase boundary here between ice 
= By < My Hy < hs and water is not exactly vertical; the slope is 
Zz actually negative, although very large. After 

= 100 International Critical Tables, Vol. 3, and P. W. 


Bridgman, Proc. Am. Acad. Sci. 47, 441 (1912); 
for the several forms of ice, see Zemansky, 


p. 375. 
50 


— 100 100 200 300 
Temperature, in °C 


In the limit as dp and dt approach zero, 


Oly Oly ale a d Omy d 6 
(Hs) a . (Rae (# HAL ei \ ees) SEs 6) 


by (3) and (5). This result may be rearranged to give 


(3), - (i) 
apie ad Fee 0) 
OO (aie pe ot 
(78), - Ge), 
which is the differential equation of the coexistence curve or vapor pressure 
curve. 


The derivatives of the chemical potential which occur in (7) may be expressed 
in terms of quantities accessible to measurement. In the treatment of the Gibbs 
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free energy in Chapter 9 we found the relations 


6G 0G 
G = Nu(p,t); F).. SiG (2), = -o. (8) 


With the definitions 
v= VIN, s=a/N (9) 


for the volume and entropy per molecule in each phase, we have 


1 (0G hg 2 an on) | 1 (6G SF Te me ai 10) 
WG Jun N° Nan)? NG Ieg oO Ne) f 


Then (7) for dp/dt becomes 


dp _ s,—-s 
dt uu 


(11) 


Here s, — s, is the increase of entropy of the system when we transfer one 
molecule from the liquid to the gas, and v, — v; is the increase of volume 
of the system when we transfer one molecule from the liquid to the gas. 

It is essential to understand that the derivative dp/dz in (11) is not simply 
taken from the equation of state of the gas. The derivative refers to the very 
special interdependent change of p and t in which the gas and liquid continue 
to coexist. The number of molecules in each phase will vary as the volume is 
varied, subject only to N, + N, = N, a constant. Here N, and N, are the 
numbers of molecules in the liquid and gas phases, respectively. 

The quantity s, — s, is related directly to the quantity of heat that must 
be added to the system to transfer one molecule reversibly from the liquid 
to the gas, while keeping the temperature of the system constant. (If heat is 
not added to the system from outside in the process, the temperature will 
decrease when the molecule is transferred to the gas.) The quantity of heat 
added in the transfer is 

dQ = (Sy. ='5))\s (12) 
by virtue of the connection between heat and the change of entropy in a 
reversible process. The quantity 


L= us, — s,) (13) 
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defines the latent heat of vaporization, and is easily measured by elementary 
calorimetry. 
We let 


Av = 0, — y (14) 


denote the change of volume when one molecule is transferred from the liquid 
to the gas. We combine (11), (13), and (14) to obtain 


| dp a 
itl (15) 
igs oe peed rete) 


This is known as the Clausius-Clapeyron equation or the vapor pressure equation. 
The derivation of this equation was a remarkable early accomplishment of 
thermodynamics. Both sides of (15) are easily determined experimentally, and 
the equation has been verified to high precision. 
We obtain a particularly useful form of (15) if we make two approximations: 
(a) We assume that v, > v,: the volume occupied by an atom in the gas 
phase is very much larger than in the liquid (or solid) phase, so that we may 


replace Av by v,: 
Av = 0, = V,/N,. (16) 


g 


At atmospheric pressure v,/v, + 10°, and the approximation is very good. 
(b) We assume that the ideal gas law pV, = Nz applies to the gas phase, 
so that (16) may be written as 


Av = t/p. (17) 


With these approximations the vapor pressure equation becomes 


d E 
7; 08? = a? (18) 


where L is the latent heat per molecule. Given L as a function of temperature, 
this equation may be integrated to find the coexistence curve. 
If, in addition, the latent heat L is independent of temperature over the 


temperature range of interest, we may take L = Lo outside the integral. Thus 
when we integrate (18) we obtain 


fe-ufS. (19) 
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Figure 10.3. Vapor pressure of water and of ice 
plotted versus 1/T. The vertical scale is 
logarithmic. The dashed line is a straight line. 
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whence 


logp = —L,/t + constant; p(t) = poexp(—Lo/t) . (20) 


where po is a constant. We defined Lo as the latent heat of vaporization of 
one molecule. If Lo refers instead to one mole, then 


P(T) = poexp(—Lo/RT) , (21) 


where R is the gas constant, R = Nokg, where No is the Avogadro constant. For 
water the latent heat at the liquid-gas transition is 2485 J g~' at 0°C and 2260 
Jg~‘ at 100°C, a substantial variation with temperature. 

The vapor pressure of water and of ice is plotted in Figure 10.3 as logp 
versus 1/T. The curve is linear over substantial regions, consistent with the 
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Along this curve the liquid and vapor 
1 are in equilibrium and coexist. Any- 
where off the curve the system is all 
0.5 


[liquid or all vapor. At much higher 
pressures a 


solid phase occurs. 


Temperature, in K 


Figure 10.4 Vapor pressure versus temperature for *He. After H. van Dijk 
et al., Journal of Research of the National Bureau of Standards 63A, 12 
(1959). 


approximate result (20). The vapor pressure of *He, plotted in Figure 10.4, 
is widely used in the measurement of temperatures between 1 and 5K. 

The phase diagram of *He at low temperatures was shown in Figure 7.14. 
Notice that the liquid-solid coexistence curve is closely horizontal below 1.4K. 
We infer from this and (11) that the entropy of the liquid is very nearly equal 
to the entropy of the solid in this region. It is remarkable that the entropies 
should be so similar, because a normal liquid is much more disordered than a 
solid, so that the entropy of a normal liquid is considerably higher than that 
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of a normal solid. But *He is a quantum liquid. For another quantum liquid, 
He, the slope of the liquid-solid curve is negative at low temperatures 
(Figure 7.15), and in this region the entropy of the liquid is less than the entropy 
of the solid. The solid has more accessible states than the liquid! Liquid *He 
has a relatively low entropy for a liquid because it approximates a Fermi gas, 
which generally has a low entropy when t « t, because a large proportion 
of the atoms have their momenta ordered into the Fermi sphere of Chapter 7. 


Triple point. The triple point t of a substance is that point p,, t, in the p-t 
plane at which all three phases, vapor, liquid, and solid, are in equilibrium. 
Here 1, = 4, = ,. Consider an equilibrium mixture of liquid and solid phases 
enclosed in a volume somewhat larger than that occupied by the mixture 
alone. The remaining volume will contain only the vapor, in equilibrium with 
both condensed phases, and at a pressure equal to the common equilibrium 
vapor pressure of both phases. This pressure is the triple point pressure. 

The triple point temperature is not identical with the melting temperature 
of the substance at atmospheric pressure. Melting temperatures depend 
somewhat on pressure; the triple point temperature is the melting temperature 
under the common equilibrium vapor pressure of the two condensed phases. 

For water the triple point temperature is 0.01K above the atmospheric 
pressure melting temperature: T, = 0.01°C = 273.16K. The Kelvin scale is 
defined such that the triple point of water is exactly 273.16 K; see Appendix B. 


Latent heat and enthalpy. The latent heat of a phase transformation, as from 
the liquid phase to the gas phase, is equal to t times the entropy difference 
of the two phases at constant pressure. The latent heat is also equal to the 
difference of H = U + pV between the two phases, where H is called the 
enthalpy. The differential is dH = dU + pdV + Vdp. When we cross the 
coexistence curve, the thermodynamic identity applies: 


tdo = dU + pdV — (u, — dN , (22) 
On the coexistence curve 1, = u;. Thus at constant pressure 
L=tAo = AU + pAV = AH = H, — H,. (23) 


Values of H are tabulated: they are found by integration of the heat capacity 


at constant pressure: 
Vv oH 
o( 5 ) = ( ) , (24) 
6t )y ot), 
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or 


H = [C,d. (25) 


Example: Model system for gas-solid equilibrium. We construct a simple model to de- 
scribe a solid in equilibrium with a gas, as in Figure 10.5. We can easily derive the vapor 
pressure curve for this model. Roughly the same model would apply to a liquid. 

Imagine the solid to consist of N atoms, each bound as a harmonic oscillator of fre- 
quency « to a fixed center of force. The binding energy of each atom in the ground state is 
&o; that is, the energy of an atom in its ground state is —ég referred to a free atom at rest. 
The energy states of a single oscillator are nh — go, where n is a positive integer or zero 
(Figure 10.6). For the sake of simplicity we suppose that each atom can oscillate only in one 
dimension. The result for oscillators in three dimensions is left as a problem. 

The partition function of a single oscillator in the solid is 


Z,= Yexpl—(riheo — &)/t] = exp(eo/t) Bexpt—nhe/) = ey 
(26) 
The free energy F, is 
F, = U, — to, = —tlogZ,. (27) 
The Gibbs free energy in the solid is, per atom, 
G, = U, — 10, + pv, = F, + pv, = fs. (28) 


may be in the gas. 
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Figure 10.5 Atoms in a solid in equilibrium 
with atoms in the gas phase. The equilibrium 
pressure is a function of temperature. The 
energy of the atoms in the solid phase is lower 
than in the gas phase, but the entropy of the 
atoms tends to be higher in the gas phase. The 
equilibrium configuration is determined by the 
counterplay of the two effects. At low 
temperature most of the atoms are in the solid; 
at high temperature all or most of the atoms 
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Figure 10.6 States of an atom bound as a 
harmonic oscillator of frequency w. The fo ———__— I — 125, 
ground state is assumed to be gp below that of a 
free atom at rest in the gas phase. 


3fiw — & 
2iw — &o 
fia — & 
=o eee 
£0 
Ground state of 
bound atom 


The pressure in the solid is equal to that of the gas with which it is in contact, but the 
volume v, per atom in the solid phase is much smaller than the volume v, per atom in the 
gas phase: vp, « vy. 

If we neglect the term pv, we have for the chemical potential of the solid x, = F,, whence 
the absolute activity is 


Ul 


exp(,/t) = exp(F,/t) = exp(—log Z,) 


hs 


i} 


z = exp(—é9/t)[1 — exp(—he/t)]. (29) 


We make the ideal gas approximation to describe the gas phase, and we take the spin of 
the atom to be zero. Then, from Chapter 6, 


8.2 memes 
4,=—= -2(% : (30) 


The gas is in equilibrium with the solid when A, = 4,, or 
p = tmgexp(—é9/1)[1 — exp(—ho/t) J. (31) 


If we insert ng from (3.63): 


2nh? 
0 Se SS 


M \3?2 
p= (3) 12 exp(—é9/t)[1 — exp(—he/t)]. (32) 
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VAN DER WAALS EQUATION OF STATE 


The simplest model of a liquid-gas phase transition is that of van der Waals, who 
modified the ideal gas equation pV = Nr to take into account approximately 
the interactions between atoms or molecules. By the argument that we give 
below, he was led to a modified equation of state of the form 


(p + N?a/V?)(V — Nb) = Nr, (33) 


known as the van der Waals equation of state. This is written for N atoms in 
volume V. The a, b are interaction constants to be defined; the constant a is 
a measure of the long range attractive part of the interaction between two 
molecules, and the constant b is a measure of their short range repulsion 
(Figure 10.7). We shall derive (33) with the help of the general relation p = 
—(0F/@V), y. We shall then treat the thermodynamic properties of the model 
in order to exhibit the liquid-gas transition. 
For an ideal gas we have, from (6.24), 


F(ideal gas) = —Nr[log(ng/n) + 1]. (34) 


The hard core repulsion at short distances can be treated approximately as 
if the gas had available not the volume V, but the free volume V — Nb, when 
b is the volume per molecule. We therefore replace the concentration » = N/V 
in (34) by N/(V — Nb). Thus, instead of (34), we have 


F = —Nrf{log[ng(V — Nb)/N] + 1}. (35) 


To this we now add a correction for the intermolecular attractive forces. 


Figure 10.7 The interaction energy between 
two molecules consists of a short range 
repulsion plus a long range attraction. The 
short range repulsion can be described 
approximately by saying that each molecule 
has a hard, impenetrable core. 


Potential energy, & 
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Mean Field Method 


There exists a simple approximate method, called the mean field method, for 
taking into account the effect of weak long range interactions among the 
particles of a system. The most widely known applications of the method are to 
gases and to ferromagnets. Let g(r) denote the potential energy of interaction 
of two atoms separated by a distance r. When the concentration of atoms in 
the gas is n, the average value of the total interaction of all other atoms on 
the atom at r = Ois 


1% dV g(rn =n is dV g(r) = —2na, (36) 


where —2a denotes the value of the integral | dV g(r). The factor of two is a 
useful convention. We exclude the hard core sphere of volume b from the 
volume of integration. In writing (36) we assume that the concentration n is 
constant throughout the volume accessible to the molecules of the gas. That is, 
we use the mean value of n. This assumption is the essence of the mean field 
approximation. By assuming uniform concentration we ignore the increase 
of concentration in regions of strong attractive potential energy. In modern 
language we say that the mean field method neglects correlations between 
interacting molecules. 

From (36) it follows that the interactions change the energy and the free 
energy ofa gas of N molecules in volume V by 


AF = AU = —}(2Nna) = —N?a/V. (37) 


The factor 4 is common to self-energy problems; it arranges that an interaction 
“bond” between two molecules is counted only once in the total energy. The 
exact number of bonds is $N(N — 1), which we approximate as $N?. 

We add (37) to (35) to obtain the van der Waals approximation for the 
Helmholtz free energy of a gas: 


F(vdW) = —Nr{log[ng(V — Nb)/N] + 1} — N?a/V. (38) 


The pressure is 


Nt N?a 
V—Nb vy 


p = —(6F/@V),.x (39) 


Critical Points for the van der Waals Gas 289 


Figure 10.8 Directions of intermolecular 
forces that act on molecules near the boundary 
of a volume V. The van der Waals argument 
suggests that these forces contribute an internal 
pressure N?a/V? which is to be added to the 
external pressure p, so that p + N?a/V? should 
be used as the pressure in the gas law. 


Figure 10.9 The container of volume V has N 
molecules, each of volume b. The volume not 
occupied by molecules is V — Nb. Intuition 
suggests that this free volume should be used in 
the gas law in place of the container volume V. 


or 


(p + N?a/V?)(V — Nb) = Nr, (40) 


the van der Waals equation of state. The terms in a and b are interpreted in 
Figures 10.8 and 10.9. 


Critical Points for the van der Waals Gas 


We define the quantities 


pe = a/27b?;  -V,=3Nb; 1, = 8a/27b. (41) 
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Figure 10.10 The van der Waals equation of state near 
the critical temperature. Courtesy of R. Cahn. 


In terms of these quantities the van der Waals equation becomes 


p 3 are 8t 
5 -z)==—. 42 
(? m amy 3) ~ 3, wee 
This equation is plotted in Figure 10.10 for several temperatures near the 


temperature t,. The equation may be written in terms of the dimensionless 
variables 


P= p/P. V=V/V.; FHEtt, (43) 


as 


ES ee Ne Le 
(0+ Fa? —3)~ 50 aalinapege porn 


This result is known as the law of corresponding states. In terms of p, V, 2, 
all gases look alike—if they obey the van der Waals equation. Values of a 


Gibbs Free Energy of the van der Waals Gas 


and b are usually obtained by fitting to the observed p, and t,. States of two 
substances at the same , V, 7 are called corresponding states of the substances. 
Real gases do not obey the equation to high accuracy. 

At one point, the critical point, the curve of p versus V at constant 7 has a 
horizontal point of inflection. Here the local maximum and minimum of the 
p-V curve coincide, and there is no separation between the vapor and liquid 
phases. At a horizontal point of inflection 


Gias—G 
av), av 


These conditions are satisfied by (44) if p = 1; PV = 1; = = 1. We call p,. V,. 
and t, the critical pressure, critical volume, and critical temperature, respectively. 
Above t, no phase separation exists. 


) =0. (45) 


6 


Gibbs Free Energy of the van der Waals Gas 


The Gibbs free energy of the van der Waals gas exhibits the characteristics of 
the liquid-gas phase transition at constant pressure. With G = F + pV, we 
have from (38) and (39) the result 


G(t,V.N) = 5p — 7 — Neloging(V — Nby/N}. (46) 


This equation gives G as a function of V, t, N; the natural variables for G are 
p, t, N. Unfortunately we cannot conveniently put G into an analytic form as a 
function of pressure instead of volume. We want G(z,p,N) because we can then 
obtain y(z,p) as G(t,p,N)/N by (9.13). Itis « that determines the phase coexistence 
relation 4, = f,. The results of numerical calculations of G versus p are plotted 
in Figure 10.11 for temperatures below and at the critical temperature. At any 
temperature the lowest branch represents the stable phase; the other branches 
represent unstable phases. The pressure at which the branches cross determines 
the transition between gas and liquid; this pressure is called the equilibrium 
vapor pressure. Results for G versus t are plotted in Figure 10.12. 

Figure 10.13 shows, on a p—V diagram, the region V < V, in which only the 
liquid phase exists and the region V > V2 in which only the gas phase exists. 
The phases coexist between V, and V3. The value of V, or V; is determined by 
the condition that 4,(t,p) = 4,(t,p) along the horizontal line between V, and 
V,. This will occur if the shaded area below the line is equal to the shaded area 
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Figure 10.11 (a) Gibbs free energy versus pressure for van 
der Waals equation of state: t = 0.95t,. Courtesy of R. Cahn. 
(b) Gibbs free energy versus pressure for van der Waals equation of 
state: tT = T,. 

above the line. To see this, consider 


dG = —odzt + Vdp + pdN. (47) 


We have dG = Vadp at constant t and constant total number of particles. The 
difference of G between V, and V, is 


G, — G, = [ Vdp . (48) 


— 1.452 


G/Nr, 
— 1.453 
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Figure 10.12a Gibbs free energy versus temperature for van der Waals equation 
of state at p = 0.95 p,. Courtesy of A. Manoliu. 
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Figure 10.12b Gibbs free energy versus temperature for van der Waals equation 
of state at the critical pressure p,. 
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Figure 10.13 Isotherm of van der Waals gas at a 
temperature below the critical temperature. For volumes 
less than V, only the liquid phase exists; for volumes 
above V; only the gas phase exists. Between V, and V3 
the system in stable equilibrium lies along the coexistence 
line and is an inhomogeneous mixture of two phases. The 
liquid and gas phases coexist. The proportion of the 
liquid and gas phases must be such that the sum of their 
volumes equals the volume V that is available. 


but the integral is just the sum of the shaded areas, one negative and one positive. 
When the magnitudes of the areas are equal, G,(t,p) = G,(z,p) and 444(t,p) = 
14,(t,p) along the horizontal coexistence line drawn in the figure. In equilibrium 
we require ty = {y. 


Nucleation. Let Au = 1, — 14 be the chemical potential difference between 
the vapor surrounding a small liquid droplet and the liquid in bulk (an infinitely 
large drop). If Au is positive, the bulk liquid will have a lower free energy than 
the gas and thus the liquid will be more stable than the gas. However, the 
surface free energy of a liquid drop is positive and tends to increase the free 
energy of the liquid. At small drop radii the surface can be dominant and the 
drop can be unstable with respect to the gas. We calculate the change in Gibbs 
free energy when a drop of radius R forms. If n, is the concentration of molecules 
in the liquid, 


AG = G, — G, = —(4n/3)R?n, Au + 4nR?y (49) 


Ferromagnetism 


where » is the surface free energy per unit area, or surface tension. The liquid 
drop will grow when G, < G,. An unstable maximum of AG is attained when 


dAG/dR = 0 = —4nR?n,Au + 82Ry , (50) 
or 
R, = 2y/n, Ap. (51) 


This is the critical radius for nucleation of a drop. At smaller R the drop will 
tend to evaporate spontaneously because that will lower the free energy. At 
larger R the drop will tend to grow spontaneously because that, too, will lower 
the free energy. 

The free energy barrier (Figure 10.14) that must be overcome by a thermal 
fluctuation in order for a nucleus to grow beyond R, is found by substitution of 
(51) in (49): 


(AG), = (16n/3)[y°/n,(Ap)*]. (52) 


If we assume that the vapor behaves like an ideal gas, we can use Chapter 5 
to express Ay as 


Ap = tlog(p/Pey) . 


where p is the vapor pressure in the gas phase and p,, the equilibrium vapor 
pressure of the bulk liquid (R + oo). We use » = 72ergcm~? to estimate R, 
for water at 300K and p = 1.1p,, to be 1 x 107-°cm. 


Ferromagnetism 


A ferromagnet has a spontaneous magnetic moment, which means a magnetic 
moment even in zero applied magnetic field. We develop the mean field approxi- 
mation to the temperature dependence of the magnetization, defined as the 
magnetic moment per unit volume. The central assumption is that each magnetic 
atom experiences an effective field By proportional to the magnetization: 


By =2M, (53) 


where J is a constant. We take the external applied field as zero. 
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Critical barrier for growth 
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Figure 10.14 Excess free energy of drop relative to gas, as 
function of drop radius R, both in reduced units. The gas is 
supersaturated because the liquid has the lower free energy for 
this curve as drawn, but the surface energy of small drops creates 
an energy barrier that inhibits the growth of nuclei of the liquid 
phase. Thermal fluctuations eventually may carry nuclei over the 
barrier. 


Consider a system with a concentration n of magnetic atoms, each of spin 4 
and of magnetic moment y. In Chapter 3 we found an exact result for the 
magnetization in a field B: 

M = nutanh(uB/z). (54) 


In the mean field approximation (53) this becomes, for a ferromagnet, 


M = nptanh(uAM/t) . (55) 


Ferromagnetism 
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Figure 10.15 Graphical solution of Eq. (56) for the 
reduced magnetization m as a function of temperature. 
The reduced magnetization is defined as m = M/ny. The 
left-hand side of Eq. (56) is plotted as a straight line m 
with unit slope. The right-hand side is tanh(m/t) and is 
plotted versus m for three different values of the reduced 
temperature t = t/nu?A = t/t,. The three curves 
correspond to the temperatures 2r,, r,, and 0.5t,. The 
curve for t = 2 intersects the straight line m only at 

m = 0, as appropriate for the paramagnetic region (there 
is no external applied magnetic field). The curve for t = 1 
(or t = 1,) is tangent to the straight line m at the origin; 
this temperature marks the onset of ferromagnetism. The 
curve for t = 0.5 is in the ferromagnetic region and 
intersects the straight line m at about m = 0.94 nu. As 

t — 0 the intercept moves up to m = 1, so that all 
magnetic moments are lined up at absolute zero. 


a transcendental equation for M. We shall see that solutions of this equation 
with nonzero M exist in the temperature range between 0 and t,. To solve (55) 
we write it in terms of the reduced magnetization m = M/nu and the reduced 
temperature t = t/nu?/, whence 


m = tanh(m/t). (56) 


We plot the right and left sides of this equation separately as functions of m, 
as in Figure 10.15. The intercept of the two curves gives the value of m at the 
temperature of interest. The critical temperature is t = 1, or t, = nu?/. The 
curves of M versus t obtained in this way reproduce roughly the features of the 
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Figure 10.16 Saturation magnetization of 
nickel as a function of temperature, together 
with the theoretical curve for spin } on the 
mean field theory. 


M,(T)/M,(0) 


experimental results, as shown in Figure 10.16 for nickel. As t increases the 
magnetization decreases smoothly to zero at t = t,, called the Curie temperature. 


LANDAU THEORY OF PHASE TRANSITIONS 


Landau gave a systematic formulation of the mean field theory of phase transi- 
tions applicable to a large variety of systems exhibiting such transitions. We 
consider systems at constant volume and temperature, so that their Helmholtz 
free energy F = U — to is a minimum in equilibrium. The big question is, a 
minimum with respect to what variables? It is not helpful to consider all possible 
variables. We suppose here that the system can be described by a single order 
parameter €, the Greek xi, which might be the magnetization in a ferromagnetic 
system, the dielectric polarization in a ferroelectric system, the fraction of 
superconducting electrons in a superconductor, or the fraction of neighbor A~B 
bonds to total bonds in an alloy AB. In thermal equilibrium the order parameter 
will have a certain value € = €o(t). In the Landau theory we imagine that € 
can be independently specified, and we consider the Landau free energy function 


F,(&) = U(€) — to(€,t) (57) 


where the energy and entropy are taken when the order parameter has the 
specified value é, not necessarily 9. The equilibrium value €o(z) is the value of 


Landau Theory of Phase Transitions 


é that makes F, a minimum, at a given t, and the actual Helmholtz free energy 
F(z) of the system at t is equal to that minimum: 


Fr) = Fy(Co.t) < Fu(Et) if FH So. (58) 


Plotted as a function of € for constant t, the Landau free energy may have more 
than one minimum. The lowest of these determines the equilibrium state. In a 
first order phase transition another minimum becomes the lowest minimum as 
t is increased. 

We restrict ourselves to systems for which the Landau function is an even 
function of € in the absence of applied fields. Most ferromagnetic and ferro- 
electric systems are examples of this. We also assume that F,(€,z) is a sufficiently 
well-behaved function of € that it can be expanded in a power series in €— 
something that should not be taken for granted. For an even function of €, as 
assumed, 


Fy(E,t) = golt) + 4ga(t)&? + Agalr)e* + égolz)E° + °°. (59) 


The entire temperature dependence of F,(é,r) is contained in the expansion 
coefficients go, J2, J4, Js. These coefficients are matters for experiment or theory. 

The simplest example of a phase transition occurs when g(t) changes sign at 
a temperature To, with g, positive and the higher terms negligible. For simplicity 
we take g,(t) linear in t: 


g2(t) = (t — Toa, (60) 
over the temperature range of interest, and we take g, as constant in that range. 
With these idealizations, 

F(t) = go(t) + 4a(t — to)é? + 4g4c*. (61) 


The form (60) cannot be accurate over a very wide temperature range, and it 
certainly fails at low temperatures because such a linear dependence on tem- 
perature is not consistent with the third law. 

The equilibrium value of € is found at the minimum of F,(é:t) with respect 
to &: 


(OF /0€), = (t — to)aE + gad? = 0, (62) 
which has the roots 


E€=0 and & = (to — 2)(a/g,). (63) 
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With « and g, positive, the root € = 0 corresponds to the minimum of the free 
energy function (61) at temperatures above ty; here the Helmholtz free energy is 


F(t) = go(t). (64) 


The other root, &? = (a/g4)(to — 1) corresponds to the minimum of the free 
energy function at temperatures below to; here the Helmholtz free energy is 


F(t) = go(t) — (27/4g4)(t — to). (65) 


The variation of F(t) with temperature is shown in Figure 10.17. The variation 
of F,(é;t) as a function of &? for three representative temperatures is shown in 
Figure 10.18, and the temperature dependence of the equilibrium value of € is 
shown in Figure 10.19. 

Our model describes a phase transition in which the value of the order param- 
eter goes continuously to zero as the temperature is increased to to. The entropy 
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Figure 10.17 Temperature dependence of the free energy 
for an idealized phase transition of the second order. 
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Figure 10.18 Landau free energy function versus €? at 
representative temperatures. As the temperature drops below to 
the equilibrium value of € gradually increases, as defined by the 
position of the minimum of the free energy. 
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Figure 10.19 Spontaneous polarization versus 
temperature, for a second-order phase transition. The 
curve is not realistic at low temperatures because of the 
use of Eq. (60): the third law of thermodynamics requires 
that dé/dt = Oast > 0. 


302 


Chapter 10: Phase Transformations 


— OF /ét is continuous at t = To, so that there is no latent heat at the transition 
temperature to. Such a transition is by definition a second order transition. 
Transitions with a nonzero latent heat are called first order transitions; we 
discuss them presently. The real world contains a remarkable diversity of 
second order transitions; the best examples are ferromagnets and super- 
conductors. 


Example: Ferromagnets. In the mean field approximation, ferromagnets satisfy the 
Landau theory. To show this, consider an atom of magnetic moment in a magnetic field B, 
which we shall set equal to the mean field 2M as in (53). The interaction energy density is 

U(M) = -3M:B = -}iM? , (66) 


where the factor 4 is common to self-energy problems. The entropy density is given approxi- 
mately by Problem 2.2 as 


o(M) = constant — M?/2np? , (67) 


in the regime in which M « ny. Thus the free energy function per unit volume is 


F,(M) = constant — wr(2 - =) + terms of higher order. (68) 


At the transition temperature the coefficient of M? vanishes, so that 
eel 
To = nya , (69) 


in agreement with the discussion following (56). 


First Order Transitions 


A latent heat characterizes a first order phase transition. The liquid-gas transi- 
tion at constant pressure is a first order transition. In the physics of solids first 
order transitions are common in ferroelectric crystals and in phase transforma- 
tions in metals and alloys. The Landau function describes a first order transition 
when the expansion coefficient g, is negative and gg is positive. We consider 


Fy(Est) = go(t) + 24(t — to)E* — dga(a)[E* + dge6° + °°. (70) 


First Order Transitions 


Landau free energy 


Figure 10.20 Landau free energy function versus €? ina 
first order transition, at representative temperatures. At t, 
the Landau function has equal minima at € = 0 and ata 
finite € as shown. For t below t, the absolute minimum is at 
larger values of €; as t passes through rf, there is a 
discontinuous change in the position of the absolute 
minimum. The arrows mark the minima. 


The extrema of this function are given by the roots of @F,/éé = 0 as in 
Figure 10.20: 

a(t — to)E — |ga(z)|E? + goE° = 0. (71) 
Either € = 0 or 

a(t — To) — |ga(z)|E? + goé* = 0. (72) 


At the transition temperature t, the free energies will be equal for the phases 
with € = 0 and with the root € # 0. The value of t, will not be equal to to, 
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Figure 10.21 Dependence of — on t — rt, fora 
typical first order phase transition. 
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and the order parameter € (Figure 10.21) does not go continuously to zero at 
t,. These results differ from those in the second order phase transition treated 
earlier, where € went to zero continuously at to = t,. A first order transfor- 
mation may show hysteresis, as in supercooling or supersaturation, but no 
hysteresis exists in a second order transition. 


SUMMARY 


1. The coexistence curve in the p-t plane between two phases must satisfy the 
Clausius-Clapeyron equation: 


dp _sL 


dt tAv’ 


where L is the latent heat and Av is the volume difference per atom between 
the two phases. 


2. The latent heat L = H, — H,, where H = U + pV is the enthalpy. 


3. The van der Waals equation of state is 
(p + N2a/V?)(V — Nb) = Nt. 
4, In the Landau free energy function 
F(t) = U(é,t) — to(€,t) , 
the energy and entropy are taken when the order parameter has the specified 


value €, not necessarily the thermal equilibrium value é). The function F,, 
is a minimum with respect to € when the system is in thermal equilibrium. 


Problems 


5. A first order phase transition is characterized by a latent heat and by 
hysteresis. 


PROBLEMS 


1. Entropy, energy, and enthalpy of van der Waals gas. (a) Show that the en- 
tropy of the van der Waals gas is 


a = N{log[ng(V — Nb)/N] + 3}. (73) 


(b) Show that the energy is 


U = 3Nt — N2a/V. (74) 

(c) Show that the enthalpy H = U + pV is 
H(t,V) = $Nt + N2bt/V — 2N7a/V; (75) 
H(t,p) = 4Nt + Nbp — 2Nap/t. (76) 


All results are given to first order in the van der Waals correction terms a, b. 


2. Calculation of dT/dp for water. Calculate from the vapor pressure equa- 
tion the value of dT/dp near p = 1 atm for the liquid-vapor equilibrium of 
water. The heat of vaporization at 100°C is 2260J g~‘. Express the result in 
kelvin/atm. 


3. Heat of vaporization of ice. The pressure of water vapor over ice is 3.88 mm 
Hg at —2°C and 4.58 mm Hg at 0°C. Estimate in J mol” the heat of vaporiza- 
tion of ice at —1°C. 


4. Gas-solid equilibrium. Consider a version of the example (26)—(32) in which 
we let the oscillators in the solid move in three dimensions. (a) Show that in the 
high temperature regime (t > ho) the vapor pressure is 


M 3/2 »? 
p= (=) S73 &x(—to/2) (77) 


(b) Explain why the latent heat per atom is ¢) — }t. 


5. Gas-solid equilibrium. Consider the gas-solid equilibrium under the ex- 
treme assumption that the entropy of the solid may be neglected over the tem- 
perature range of interest. Let —é9 be the cohesive energy of the solid, per atom. 
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Treat the gas as ideal and monatomic. Make the approximation that the volume 
accessible to the gas is the volume V of the container, independent of the much 
smaller volume occupied by the solid. (a) Show that the total Helmholtz free 
energy of the system is 


F =F, + F, = —Néo + N,t[log(N,/Vng) — 1] , (78) 


where the total number of atoms, N = N, + N, is constant. (b) Find the mini- 
mum of the free energy with respect to N,; show that in the equilibrium condition 


N, = ngV exp(—£o/7). (79) 
(c) Find the equilibrium vapor pressure. 
6. Thermodynamics of the superconducting transition. (a) Show that 


1 d(B2) B. dB 
=a) set eee 
Os Se TT aes 7 ee 


(80) 


in SI units for B,. Because B, decreases with increasing temperature, the right 
side is negative. The superconducting phase has the lower entropy: it is the more 
ordered phase. As t > 0, the entropy in both phases will go to zero, consistent 
with the third law. What does this imply for the shape of the curve of B, versus t? 
(b) Att = t,, we have B. = 0 and hence os = cy. Show that this result has the 
following consequences: (1) The two free energy curves do not cross at t, but 
merge, as shown in Figure 10.22. (2) The two energies are the same: Us(z,) = 
Uy(t,). (3) There is no latent heat associated with the transition at t = r,. 
What is the latent heat of the transition when carried out in a magnetic field, 
at t < t,? (c) Show that Cy and Cy, the heat capacities per unit volume, are 
related by 


t d*(B.2) 


pa 81 
2g dt? ) 


ACZOLL C= 


Figure 8.18 is a plot of C/T vs T? and shows that Cy decreases much faster 
than linearly with decreasing t, while Cy decreases as )t. For t « t,, AC is 
dominated by Cy. Show that this implies 
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Figure 10.22 Experimental values of the free energy as a function of 
temperature for aluminum in the superconducting state and in the normal 
state. Below the transition temperature T, = 1.180 K the free energy is lower 
in the superconducting state. The two curves merge at the transition 
temperature, so that the phase transition is second order (there is no latent heat 
of transition at T,). The curve Fs is measured in zero magnetic field, and Fy is 
measured in a magnetic field sufficient to put the specimen in the normal state. 
Courtesy of N. E. Phillips. 


7. Simplified model of the superconducting transition. The B(x) curves of most 
superconductors have shapes close to simple parabolas. Suppose that 


Bt) = Bool — (2/t.)?]. _(83) 


Assume that Cy vanishes faster than linearly as t + 0. Assume also that Cy is 
linear in t, as for a Fermi gas (Chapter 7). Draw on the results of Problem 6 to 
calculate and plot the t dependences of the two entropies, the two heat capacities, 
and the latent heat of the transition. Show that Cs(z,)/Cy(t.) = 3. 


8. First order crystal transformation. Consider a crystal that can exist in either 
of two structures, denoted by « and £. We suppose that the x structure is the 
stable low temperature form and the f structure is the stable high temperature 
form of the substance. If the zero of the energy scale is taken as the state of 
separated atoms at infinity, then the energy density U(0) at t = 0 will be 
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negative. The phase stable at t = 0 will have the lower value of U(0); thus 
U0) < U,(0). If the velocity of sound vy in the f phase is lower than v, in the 
a phase, corresponding to lower values of the elastic moduli for f, then the 
thermal excitations in the 8 phase will have larger amplitudes than in the « 
phase. The larger the thermal excitation, the larger the entropy and the lower 
the free energy. Soft systems tend to be stable at high temperatures, hard 
systems at low. (a) Show from Chapter 4 that the free energy density contributed 
by the phonons in a solid at a temperature much less than the Debye temperature 
is given by —2*r*/30v°h*, in the Debye approximation with v taken as the 
velocity of all phonons. (b) Show that at the transformation temperature 


a4 = (30h3/n?)[U,(0) — U,(0)]/(vg~> — v.73). (84) 
There will be a finite real solution if vy, < v,. This example is a simplified model 
of a class of actual phase transformations in solids. (c) The latent heat of trans- 
formation is defined as the thermal energy that must be supplied to carry the 
system through the transformation. Show that the latent heat for this model is 


L = 4[U,(0) — U,(0)]. (85) 


In (84) and (85), U refers to unit volume. 
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Many applications of materials science, and large parts of chemistry and 
biophysics, are concerned with the properties of multicomponent systems that 
have two or more phases in coexistence. Beautiful, unexpected, and important 
physical effects occur in such systems. We treat the fundamentals of the subject 
in this chapter, with examples drawn from simple situations. 


SOLUBILITY GAPS 


Mixtures are systems of two or more different chemical species. Binary mixtures 
have only two constituents. Mixtures with three and four constituents are called 
ternary and quaternary mixtures. If the constituents are atoms, and not mole- 
cules, the mixture is called an alloy. 

A mixture is homogeneous when its constituents are intermixed on an atomic 
scale to form a single phase, as in a solution. A mixture is heterogeneous when it 
contains two or more distinct phases, such as oil and water. The everyday 
expression “oil and water do not mix” means that their mixture does not form 
a single homogeneous phase. 

The properties of mixtures differ from the properties of pure substances. The 
melting and solidification properties of mixtures are of special interest. Hetero- 
geneous mixtures may melt at lower temperatures than their constituents. 
Consider a gold-silicon alloy: pure Au melts at 1063°C and pure Si at 1404°C, 
but an alloy of 69 pet Au and 31 pct Si melts (and solidifies) at 370°C. This is not 
the result of the formation of any low-melting Au-Si compound: microscopic 
investigation of the solidified mixture shows a two phase mixture of almost 
pure Au side by side with almost pure Si (Figure 11.1). Mixtures with such 
properties are common, and they are of practical importance precisely because 
of their lowered melting points. 

What determines whether two substances form a homogeneous or a hetero- 
geneous mixture? What is the composition of the phases that are in equilibrium 
with each other in a heterogeneous mixture? The properties of mixtures can be 
understood from the principle that any system at a fixed temperature will 
evolve to the configuration of minimum free energy. Two substances will 
dissolve in each other and form a homogeneous mixture if that is the configura- 
tion of lowest free energy accessible to the components. The substances will 


Solubility Gaps 


y FACIL 


10 um 


Figure 11.1 Heterogeneous gold-silicon alloy. When a mixture of 69 pet Au and 31 pet 
Si is melted and then solidified, the mixture segregates into a phase of almost pure Au 
(light phase) coexistent with a phase of almost pure Si (dark phase). Magnified about 
800 times, The composition given is that of the lowest-melting Au-Si mixture, the 
so-called eutectic mixture, a concept explained later in the text. Photograph courtesy 
of Stephan Justi. 


form a heterogeneous mixture if the combined free energy of the two separate 
phases side by side is lower than the free energy of the homogeneous mixture: 
then we say that the mixture exhibits a solubility gap. 

A heterogeneous mixture will melt at a lower temperature than the separate 
substances if the free energy of the homogeneous melt is lower than the com- 
bined free energies of the two separate solid phases. 

Throughout this chapter we assume for simplicity that the external pressure 
may be neglected, and we set pV = 0. Then volume changes do not involve 
work, and the appropriate free energy is the Helmholtz free energy F rather 
than the Gibbs free energy G. We will usually simply speak of the free energy. 

We discuss binary mixtures of constituents that do not form well-defined 
compounds with each other. Our principal interest is in binary alloys. Consider 
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a mixture of N, atoms of substance A and Nx atoms of substance B. The total 
number of atoms is 


N=N, + Ng. (1) 
We express the composition of the system in terms of the fraction x of B atoms: 
x = N,/N; 1—x=N,/N. (2) 


Suppose the system forms a homogeneous solution, with an average free energy 
per atom given by 


Siam FN, (3) 


Suppose further that f(x) has the functional form shown in Figure 11.2. Because 
this curve contains a range in which the second derivative d?f/dx? is negative, 
we can draw a line tangent to the curve at two points, at x = x, and x = X,. 
Free energy curves of this shape are common, and we will see later what may 
cause this shape. Any homogeneous mixture in the composition range 


Me Oy (4) 


is unstable with respect to two separate phases of composition x, and x». We 
shall show that the average free energy per atom of the segregated mixture is 
given by the point i on the straight line connecting the points « and f. Thus in the 
entire composition range (4) the segregated system has a lower free energy than 
the homogeneous system. 

Proof: The free energy of a segregated mixture of the two phases « and f is 


F = N,f(x,) + No f(xp) > (5) 


where N, and N, are the total numbers of atoms in phases x and , respectively. 
These numbers satisfy the relations 


N,+N,=N; XN, + XpN, = Ne. (6) 
which may be solved for N, and Ny: 


i fae) peel pe () 
xp ae Xp — Xe 


Solubility Gaps 


Figure 11.2 Free energy per atom as a function of composition, for a 
system with a solubility gap. If the free energy per atom of a 
homogeneous mixture has a shape such that a tangent can be drawn 
that touches the curve at two different points « and f, the composition 
range between the two points is unstable. Any mixture with a 
composition in this range will decompose into two phases with the 
composition x, and x,. The free energy of the two phase mixture is 
given by the point i on the straight line, below the point h. 


From (5) we obtain 


fe) = = = =z [lxp — fla) + Oe — xflxp)] . (8) 


Xp — 
for the free energy of the two phase system. This result is linear in x and is a 
straight line in the f—x plane. If we set x = x, or xg, we see that the line does go 
through the points « and f. Thus f; in the interval between x, and x, is given by 
the point i on the straight line connecting « and f. 
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We have not yet made use of the assumption that the straight line is tangent to 
f(x) at the points « and £, and therefore our result holds for any straight line 
that has two points « and f in common with f(x). But for a given value of x, the 
lowest free energy is obtained by drawing the lowest possible straight line that 
has two points in common with f(x), on opposite sides of x. The lowest possible 
straight line is the two-point tangent shown. The compositions x, and Xp are 
the limits of the solubility gap of the system. 

Once the system has reached its lowest free energy, the two phases must be in 
diffusive equilibrium with respect to both atomic species, so that their chemical 
potentials satisfy 


Haw = Hag; Hepa = Mpg: (9) 


We show in Problem 1 that 1, and jg are given by the intercepts of the two- 
point tangent with the two vertical edges of the f(x) plot at x = 0 and x = 1, 
as in Figure 11.2. 


ENERGY AND ENTROPY OF MIXING 


The Helmholtz free energy F = U — ta has contributions from the energy and 
from the entropy. We treat the effect of mixing two components A and B on 
both terms. Let u, and uy be the energy per atom of the pure substances A and B, 
referred to separated atoms at infinity. The average energy per atom of the 
constituents is 


u = (uyNq + UpNg)/N = ug + (Ug — Ua)x (10) 


which defines a straight line in the u—x plane, Figure 11.3. The average energy 
per atom of the homogeneous mixture may be larger or smaller than for the 
separate constituents. In the example of Figure 11.3, the energy of the homoge- 
neous mixture is larger than the energy of the separate constituents. The 
energy excess is called the energy of mixing. 

If the —to term in the free energy is negligible, as at t = 0, a positive mixing 
energy means that a homogeneous mixture is not stable. Any such mixture will 
then separate into two phases. But at a finite temperature the —to term in the 
free energy of the homogeneous mixture always tends to lower the free energy. 

The entropy of a mixture contains a contribution, called the entropy of mixing, 
that is not present in the entropies of the separate components. The mixing 
entropy arises when atoms of the different species are interchanged in position; 
this operation generates a different state of the system. Because of such inter- 
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Figure 11.3. Energy per atom as a function of composition 
in a system with a positive mixing energy. A simple 
example for which a solubility gap may occur is that of a 
system in which the energy per atom of the homogeneous 
mixture is greater than that of the separate phases, so that 
d*u/dx? < 0 for all compositions. The mixing energy is the 
difference between the u(x) curve and the straight line. 


changes a mixture has more accessible states than the two separate substances, 
and hence the mixture has the higher entropy. 

In (3.80) we calculated the mixing entropy oy of a homogeneous alloy 
A, _,B,, to find 


oy = —N[(1 — x)log(1 — x) + xlogx], (11) 


as plotted in Figure 11.4. The curve of oy, versus x has the important property 
that the slope at the ends of the composition range is vertical. We have 


1 1-») 
don = log(1 — x) — logx = log = a (12) 


N dx 


which goes to +00 as x > Oand to —w asx > 1. 
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Figure 11.4 Mixing entropy. In any mixture of two constituents an 
interchange of two atoms of different species leads to a new state of the 
system. The logarithm of the number of states related in this way is the 
mixing entropy. 


Consider now the quantity 
folx) = u(x) — (6 — oy)t/N . (13) 


which is the free energy per atom without the mixing entropy contribution. The 
non-mixing part of the entropy, ¢ — oy, is usually nearly the same for the 
mixture as for the separate components, so that (¢ — oy)t is nearly a linear 
function of the composition x. If we assume this, the f(x) curve has the same 
shape as the u(x) curve, but offset vertically. 

If we add the mixing entropy contribution —toy,/N to fo(x), we obtain at 
various temperatures the f(x) curves shown in Figure 11.5. In drawing the 
figure we have ignored the temperature dependence of f(x) itself, because for 
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a 


Figure 11.5 Free energy per atom versus composition, at three 
temperatures. The curve fo is the free energy per atom without the 
mixing entropy contribution. For illustration a parabolic composition 
dependence is assumed, and the temperature of f, is neglected. 

The three solid curves represent the free energy including the 

mixing entropy, for the temperatures 0.8 ty,, 1.0 t4,,and 1.2 thy, 
where ty, is the maximum temperature for which there is a solubility 
gap. The phase separation at 0.8 ty, is apparent. 


our argument this is irrelevant. Three important deductions follow from the 
construction of the f(x) curves: 


(a) At all finite temperatures f(x) turns up at both ends of the composition 
range, because of the infinite slope of the mixing entropy contribution. 

(b) Below a certain temperature ty, there is a composition range within which 
the negative second derivative of the fo(x) curve is stronger than the 
positive second derivative of the —to, contribution, thereby making it 
possible to draw a common tangent to f(x) at two different values of x. 


(c) Above ty, the curve has a positive second derivative at all compositions. 
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We conclude that the A-B system with positive mixing energy will exhibit a 
solubility gap below the temperature t,,. The composition range of the gap 
widens with decreasing temperature, but the gap can reach the edges of the 
composition range only as t > 0. At any finite temperature there is a finite 
solubility of A in B and of B in A, a result obtained earlier in Chapter 3. The 
new result is that the mutual solubility is limited only below ty. Positive 
mixing energies arise in different ways. We now discuss three examples. 


Example: Binary alloy with nearest-neighbor interactions. Consider an alloy A,—,B, in 
which the attractive interaction between unlike atoms is weaker than the attractive inter- 
action between like atoms. For simplicity we speak of the interactions as bonds. There are 
three different bonds: A~A, A-B, and B-B. Let uy, Uag and Ugg be the potential energies of 
each bond. These binding energies will usually be negative with respect to separated atoms. 

We assume the atoms are randomly distributed among the lattice sites. The average 
energy of the bonds surrounding an A atom is 


uy = (1 — X)ga + Xap. (14) 


where (1 — x)is the proportion of A and x is the proportion of B. This result is written in the 
mean field approximation of Chapter 10. Similarly, for B atoms, 


Ug = (1 — X)Uag + XUpp. (15) 


The total energy is obtained by summing over both atom types. If each atom has p nearest 
neighbors, the average energy per atom is 


$p[(1 — x)ua + xUg] 
$p[(1 — xuay + 2x(1 — Xap + X7Upp]- (16) 


u 


The factor 4 arises because each bond is shared by the two atoms it connects. The result (16) 
can be written as 


u = 4p[(1 — x)uay + XUpp] + Uy. (17) 
Here 
Uy = 4px(1 — x)[Uay — (uaa + Upp) ] (18) 


is the mixing energy. On this model the mixing energy as a function of x is a parabola, as in 
Figure 11.5. 
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A solubility gap occurs whenever d?f/dx? < 0, that is, when 


Pui Puy ao Poy 
de dx > dt = * ax? (19) 
From (18), 
@u 
zi + = —p[uap — (uaa + Upp)]- (20) 
% 
From (12), 
t @oy t t 
= rr 1 
N dx? i= 4 a, 
The equal sign holds for x = 4. With these results (19) yields 
tw = 4p[uan — 3(Uaa + Upp) ] (22) 


as the temperature below which there will be a solubility gap. 

There are many reasons why mixed bonds may be weaker than the bonds of the separate 
constituents. If the constituent atoms of an alloy differ in radius, the difference introduces 
elastic strains that raise the energy. Water and oil “do not mix” because water molecules 
carry a large electric dipole moment that leads to a strong electrostatic attraction between 
water molecules. This attraction is absent in water-oil bonds, which are only about as 
strong as the weaker oil-oil bonds. 


Example: Mixture of two solids with different crystal structures. Consider a homoge- 
neous crystalline mixture of gold and silicon. The stable crystal structure of gold is the face- 
centered cubic structure in which every atom is surrounded by twelve equidistant nearest 
neighbors. The stable crystal structure of silicon is the diamond structure in which every 
atom is surrounded by only four equidistant nearest neighbors. If in pure Au we replace a 
small fraction x of the atoms by Si, we obtain a homogeneous mixture Au, _ ,Si, with the 
fec crystal structure of Au. Similarly, if in pure Si we replace a small fraction 1 — x of the 
atoms by Au, we obtain a homogeneous mixture Au, _,Si,, but with the diamond crystal 
structure of Si. There are two different free energies, one for each crystal structure (Figure 
11.6). The two curves must cross somewhere in the composition range, or else pure Au and 
Si would not crystallize in different structures. The equilibrium curve consists of the lower 
of the two curves, with a kink at the crossover point. Such a system exhibits a solubility 
gap on either side of the crossover composition. The curves shown in the figure are sche- 
matic; in the actual Au-Si system the unstable range extends so close to the edges of the 
diagram that it cannot be represented on a full-scale plot extending from x = 0 to x = 1. 
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Figure 11.6 Free energy versus composition for crystalline 
homogeneous mixtures for which the two constituents of the 
mixture crystallize in different crystal structures, Two different 
free energy curves are involved, one for each crystal structure. 


Different crystal structures for the pure constituents are an important cause of solubility 
gaps in crystalline solid mixtures. Our argument applies to mixtures of this kind, provided 
the two structures do not transform continuously into each other with changing composi- 
tion. This is a tacit assumption in our discussion, an assumption not always satisfied when 
the two crystal structures are closely similar. The other assumption we make throughout 
this chapter is that no stable compound formation should occur. In the presence of com- 
pound formation the behavior of the mixture may be more complex. 


Example: Liquid *He—*He mixtures at low temperatures. The most interesting liquid mix- 
ture with a solubility gap is the mixture of the two helium isotopes “He and “He, atoms of 
the former being fermions and of the latter bosons. There is a solubility gap in the mixture 
below 0.87 K, as in Figure 11.7. This property is utilized in the helium dilution refrigerator 
(Chapter 12). The mixing energy must be positive to have a solubility gap. The origin of the 
positive mixing energy is the following: *He atoms are bosons. At sufficiently low tempera- 
tures almost all *He atoms occupy the ground state orbital of the system, where they have 
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Figure 11.7 Liquid mixtures of *He and *He. 


zero kinetic energy. Almost the entire kinetic energy of the mixture is contributed by the 
He atoms, which are fermions. The energy per atom of a degenerate Fermi gas increases 
with concentration as n?/>, as in Chapter 7. This energy has a negative second derivative 
076/dn? «x 6?u/éx?, which by (19) is equivalent to a positive mixing energy. 


iS ee Te 
Phase Diagrams for Simple Solubility Gaps 


A phase diagram represents the temperature dependence of solubility gaps, as in 
Figure 11.8. The two compositions x, and x, are plotted horizontally, the 
corresponding temperature vertically. The x, and x, branches merge at the 
maximum temperature ty, for which a solubility gap exists. At a given tempera- 
ture any mixture whose overall composition falls within the range enclosed by 
the curve is unstable as a homogeneous mixture. The phase diagrams of actual 
mixtures with solubility gaps may be more complex, according to the actual 
form of the free energy relation f(x), but the underlying principles are the same. 
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Figure 11.8 Phase diagram for a binary system with a solubility 
gap. A homogeneous mixture of composition x will be unstable at 
temperature t if the point (x,t) falls below the stability boundary 
curve. The mixture will then form two separate phases of the 
compositions given by the intersections of the stability boundary 
curve with the horizontal line for temperature t. The stability 
boundary curve shown here was calculated quantitatively for the 
system of Figure 11.5, with a parabolic fo(x). 


PHASE EQUILIBRIA BETWEEN LIQUID 
AND SOLID MIXTURES 


When a small fraction of a homogeneous liquid mixture freezes, the composition 
of the solid that forms is almost always different from that of the liquid. The 
phenomenon is readily understood from the free energies for liquid and solid 
mixtures. We consider a simple model, under two assumptions: (a) Neither the 
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solid nor the liquid has a solubility gap. (b) The melting temperature t, of pure 
constituent A is lower than the melting temperature t, of pure constituent B. 
We consider a temperature between t, and ty. 

The free energies per atom, fs(x) for the solid and f,(x) for the liquid, are 
shown qualitatively in Figure 11.9a. The two curves intersect at some com- 
position. Let us draw a tangent common to both curves, touching f; at x = x5 
and f, at x = x,. We can define three composition ranges, each with different 
internal equilibria: 

(a) When x < x,, the system in equilibrium is a homogeneous liquid. 

(b) When x; < x < xs, the system in equilibrium consists of two phases, a 
solid phase of composition xs and a liquid phase of composition x,. 

(c) When x > xs the system in equilibrium is a homogeneous solid. 

The compositions xs and x, of a solid and a liquid phase in equilibrium are 
temperature dependent. As the temperature decreases the free energy of the 
solid decreases more rapidly than that of the liquid. The tangential points in 
Figure 11.9a move to the left. This behavior is represented by a phase diagram 
similar to the earlier representation of the equilibrium composition curves for 
mixtures with phase separation. In Figure 11.9b the curve for x,, is called the 
liquidus curve ; the curve for xs is the solidus curve. 

The phase diagrams have been determined experimentally for vast numbers 
of binary mixtures. Those for most of the possible binary alloys are known.* 
For most metal alloys the phase diagrams are more complicated than Figure 
11.9b, which was drawn for a simple system, germanium-silicon. 

When the temperature is lowered in a binary liquid mixture with the phase 
diagram of Figure 11.9b, solidification takes place over a finite temperature 
range, not just at a fixed temperature. To see this, consider a liquid with the 
initial composition x;, shown in Figure 11.10. As the temperature is lowered, 
solidification begins at t = t;. The composition of the solid formed is given 
by xis, so that the composition of the remaining liquid is changed. In the 
example x;5 > X;,, So that the liquid moves towards lower values of x, where the 
solidification temperature is lower. The temperature has to be lowered if 
solidification is to continue. The composition of the liquid moves along the 
liquidus curve until the solidification is completed at t = t,. The solid formed 
is nonuniform in composition and is not in equilibrium. The solid may homoge- 
nize afterward by atomic diffusion, particularly if the temperature remains 
high for a long time. But for many solids atomic diffusion is too slow, and the 
inhomogeneity remains “frozen in” indefinitely. 


* The standard tabulations are by M. Hansen, Constitution of binary alloys, McGraw-Hill, 1958; 
R. P. Elliott, Constitution of binary alloys, first supplement, McGraw-Hill, 1965; F. A. Shunk, 
Constitution of binary alloys, second supplement, McGraw-Hill, 1969. 
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(a) 


(b) 


Figure 11.9 Phase equilibrium between liquid and solid mixtures. In 
this example neither phase exhibits a solubility gap. We assume 
Ta < T < tg. The upper figure (a) shows the free energies for the two phases; 
the lower figure (b) shows the corresponding phase diagram. The curves x, 
and x, in the phase diagram are called the liquidus and the solidus curves. 
The phase diagram is the Ge-Si phase diagram, with Tg. = 940°C and 
Ts, = 1412°C. 
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Figure 11.10 Most liquid mixtures do not solidify at a sharp 
temperature, but over a finite temperature range from t; to t,, The 
higher-melting constituent precipitates first, thereby enriching the 
lower-melting constituent in the liquid phase and thus lowering the 
solidification temperature of the liquid. 


Advanced Treatment: Eutectics. There are many binary systems in which the 
liquid phase remains a liquid down to temperatures significantly below the 
lower melting temperature of the constituents. The gold-silicon alloy is such a 
system: a mixture of 69 pet Au and 31 pct Si starts to solidify at 370°C. At other 
compositions solidification starts at a higher temperature. When we plot the 
temperature of the onset of solidification as a function of alloy composition, 
we obtain the two-branch liquidus curve in Figure 11.11. Mixtures with two 
liquidus branches are called eutectics. The minimum solidification temperature 
is the eutectic temperature, where the composition is the eutectic composition. 

The solidified solid at the eutectic composition is a two phase solid, with 
nearly pure gold side by side with nearly pure silicon, as in Figure 11.1. In the 
solid Au-Si mixture there is a very wide solubility gap. The low melting point 
occurs for the eutectic composition because the free energy of the homogeneous 
melt is lower than the free energy of the two phase solid, for temperatures at 
or above the eutectic temperature. 

Such behavior is common among systems that exhibit a solubility gap in the 
solid but not in the liquid. The behavior of eutectics can be understood from the 
free energy plots in Figure 11.12a. We assume f(x) for the solid as in Figure 11.6, 
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Figure 11.11 Eutectic phase diagram of gold-silicon alloys. The liquidus consists of two 
branches that come together at the eutectic temperature T, = 370°C. The horizontal 
line and the experimental data points at 370°C indicate that throughout the entire 
composition range the mixture does not complete its solidification until the eutectic 
temperature is reached. 


corresponding to different crystal structures « and f for the two pure con- 
stituents. Figure 11.12a is constructed for a temperature above the eutectic 
temperature but below the melting temperature of either constituent, so that 
the free energy of the liquid reaches below the common tangent to the solid 
phase curves. We can draw two new two-point tangents that give even lower free 
energies. We now distinguish five different composition ranges: 

(a) and (e). For x < x,s Or ¥ > Xps, the equilibrium state of the system is a 
homogeneous solid. In the first range the solid will have the crystal structure «; 
in the second range the structure is f. 

(c). For x,, < xX < Xp, the equilibrium state is a homogeneous liquid. 

(b) and (d). For x,5 < x < X,, OF Xpg_, < X < Xps, a liquid phase is in 
equilibrium with a solid phase. 

As the temperature is lowered, f,; and f,s decrease more rapidly than f,, 
and the range of the homogeneous liquid becomes narrower. Figure 11.12b 
shows the corresponding phase diagram, including the two solidus curves. 
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(a) 


(b) 


Figure 11.12 Free energies (a) and phase diagram (b) in a simple eutectic 
system. 


At the eutectic temperature t, the free energy of the liquid phase is tangential 
to the common tangent to f,; and fjs, as in Figure 11.13. The composition at 
which f;, touches the tangent is the eutectic composition. At t < t,, the free 
energy /; lies above the tangent, although f;, may be below the free energy of 
a homogeneous solid. 

A mixture of composition equal to the eutectic composition solidifies and 
melts at a single temperature, just like a pure substance. The solidification of 
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Figure 11.13 Free energies in a eutectic system att = t, and att < t,. 


compositions away from the eutectic composition starts at a higher temperature 
and ends at the eutectic temperature. Melting starts at the eutectic temperature 
and ends at a higher temperature. 

The minimum property of the melting temperature of eutectics is widely 
utilized. The Au-Si eutectic plays a large role in semiconductor device tech- 
nology: the eutectic permits low temperature welding of electrical contact 
wires made of gold to silicon devices. Lead-tin alloys exhibit a eutectic (Figure 
11.14) at 183°C to give solder a melting temperature below that of pure tin, 
232°C. According to whether a sharp melting temperature or a melting range is 
desired, either the exact eutectic composition (26 pct lead) or a different com- 
position is employed. Salt sprinkled on ice melts the ice because of the low 
eutectic temperature —21.2°C of the H,O-—NaCl eutectic at 8.17 mol pet NaCl. 

The solidus curves of eutectic systems vary greatly in character. For the 
Pb-Sn system (Figure 11.14) the solid phases in equilibrium with the melt 
contain an appreciable fraction of the minority constituent, and this fraction 
increases with decreasing temperature. In other systems this fraction may be 
small or may decrease with decreasing temperature, or both. The Au-Si system 
is an example: The relative concentration of Au in solid Si in equilibrium with 
an Au-Si melt reaches a maximum value of only 2 x 10~° around 1300°C, and 
it drops off rapidly at lower temperature. 

In our discussion of the free energy curves of Figures 11.12 and 11.13 we 
assumed that the composition at which the liquid phase free energy touches the 
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Figure 11.14 (a) Phase diagram of the Pb-Sn system, after Hansen. (b) Microphotograph 
of the Pb-Sn eutectic, magnified about 800 times. Courtesy of J. D. Hunt and K. A. 
Jackson. 
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tangent to the solid phase curves lies between the compositions x,s and x,s. 
In some systems this point lies outside the interval, as if either f,s and f, or fps 
and f, were interchanged in Figure 11.12a. Such systems are called peritectic 
systems. 


SUMMARY 


1. A mixture exhibits a solubility gap when the combined free energy of two 
separate phases side by side is lower than the free energy of the homogeneous 
mixture. 


2. The mixing entropy arises when atoms of different species are interchanged 
in position. For the alloy A, _,B,, we have 
oy = —N[(1 — x)log(1 — x) + xlogx]. 
3. The mixing energy for nearest-neighbor interactions is 
uy = spx(1 — x)[uap — 3(aa + Upp)] » 


for p nearest neighbors. 


4. The liquidus is the composition curve x, versus t for a liquid phase in 
equilibrium with a solid. The solidus is the composition curve Xs versus t 
for a solid phase in equilibrium with a liquid. 


5. Mixtures with two branches to the liquidus curve are called eutectics. The 
minimum solidification temperature is called the eutectic temperature. 


PROBLEMS 


1. Chemical potentials in two-phase equilibrium. Show that the chemical po- 
tentials za, and jy of the two atomic species A and B of an equilibrium two 
phase mixture are given by the intercepts of the two-point tangent in Figure 11.2 
with the vertical edges of the diagram at x = 0 and x = 1. 


2. Mixing energy in? He-*He and Pb-Sn mixtures. The phase diagram of liq- 
uid 3He—*He mixtures in Figure 11.8 shows that the solubility of *He in *He 
remains finite (about 6 pct) as t > 0. Similarly, the Pb—Sn phase diagram of 
Figure 11.14 shows a finite residual solubility of Pb in solid Sn with decreasing 


Problems 


t. What do such finite residual solubilities imply about the form of the function 
u(x)? 


3. Segregation coefficient of impurities. Let B be an impurity in A, with x « 1. 
In this limit the non-mixing parts of the free energy can be expressed as linear 
functions of x, as fo(x) = fo(0) + xfo'(0), for both liquid and solid phases. 
Assume that the liquid mixture is in equilibrium with the solid mixture. Calculate 
the equilibrium concentration ratio k = x s/x,, called the segregation coefficient. 
For many systems k « 1, and then a substance may be purified by melting 
and partial resolidification, discarding a small fraction of the melt. This principle 
is widely used in the purification of materials, as in the zone refining of semi- 
conductors. Give a numerical value for k for fos’ — fo,’ = 1eV and T = 1000K. 


4. Solidification range of a binary alloy. Consider the solidification of a binary 
alloy with the phase diagram of Figure 11.10. Show that, regardless of the 
initial composition, the melt will always become fully depleted in component B 
by the time the last remnant of the melt solidifies. That is, the solidification 
will not be complete until the temperature has dropped to T',. 


5, Alloying of gold into silicon. (a) Suppose a 1000 A layer of Au is evaporated 
onto a Si crystal, and subsequently heated to 400°C. From the Au—Si phase 
diagram, Figure 11.11, estimate how deep the gold will penetrate into the 
silicon crystal. The densities of Au and Si are 19.3 and 2.33gcm~*, (b) Redo 
the estimate for 800°C. 
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Cryogenics is the physics and technology of the production of low temperatures. 
We discuss the physical principles of the most important cooling methods, 
down to the lowest temperatures. 

The dominant principle of low temperature generation down to 10 mK is the 
cooling of a gas by letting it do work against a force during an expansion. The 
gas employed may be a conventional gas; the free electron gas in a semicon- 
ductor; or the virtual gas of *He atoms dissolved in liquid *He. The force 
against which work is done may be external or internal to the gas. Below 
10mK the dominant cooling principle is the isentropic demagnetization of a 
paramagnetic substance. 

We discuss the cooling methods in the order in which they occur in a 
laboratory cooling chain that starts by liquefying helium and proceeds from 
there to the lowest laboratory temperatures, usually 10mK, sometimes 1 1K. 
Household cooling appliances and automobile air conditioners utilize the 
same evaporation cooling method that is used in the laboratory for cooling 
liquid helium below its boiling temperature, to about 1 K. 


COOLING BY EXTERNAL WORK 
IN AN EXPANSION ENGINE 


In the isentropic expansion of a monatomic ideal gas from pressure p, to a 
lower pressure p>, the temperature drops according to 


Tz = T,(p2/p,)?", (1) 


by (6.64). Suppose p, = 32atm; p, = latm; and T, = 300K; then the tem- 
perature will drop to T, = 75K. We are chiefly interested in helium as the 
working gas, and for helium (1) is an excellent approximation if the cooling 
process is reversible. 

The problems in implementing expansion cooling arise from the partial 
irreversibility of actual expansion processes. The problems are compounded 
by the nonexistence of good low temperature lubricants, Actual expansion 
cooling cycles follow Figure 12.1. The compression and expansion parts of 
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Figure 12.1 Simple expansion refrigerator. A working gas is 
compressed; the heat of compression is ejected into the 
environment. The compressed room temperature gas is 
precooled further in the counterflow heat exchanger. It then 
does work in an expansion engine, where it cools to a 
temperature below that of the working volume. After extracting 
heat from the working volume, the gas returns to the compressor 
via the heat exchanger. 
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the cycle are separated. The compression is performed at or above room 
temperature. The hot compressed gas is cooled to near room temperature 
by ejecting heat into the environment. The gas is further precooled in a counter- 
flow heat exchanger by contact with the cold return gas stream at the low 
temperature of the cooling load. The gas is then cooled to its lowest temperature 
in the expansion engine, usually a low friction turbine. The cold gas extracts 
heat from the cooling load and then returns to the compressor via the heat 
exchanger. The heat exchanger greatly reduces the cooling requirements im- 
posed on the expansion engine. The design of the heat exchanger is as important 
as the design of the expansion engine. 

The work extracted by the expansion engine is the enthalpy difference 
between the input and output gas: The total energy flowing into the expansion 
engine is the internal energy U, of the gas plus the displacement work p,V, 
done by the compressor, where both U, and V, refer to a given mass of gas. 
The total energy leaving the engine with the gas is the energy U, of the gas 
plus the work p,V, required to move the gas against the pressure p,. The work 
extracted by the engine is the difference 


W = (U, + pi) — (U2 + p2V2) = Hy — Hp. (2) 


For a monatomic ideal gas U = }Nr and pV = Nt, hence H = 3Nt. The 
work performed on the engine by the gas is 


W = §N(t, — 12). (3) 


The counterflow heat exchanger is an enthalpy exchange device: it is an 
expansion engine which extracts no external work. 

Most gas liquefiers use expansion engines to precool the gas close to its 
liquefaction temperature. It is impractical to carry the expansion cooling to 
the point of liquefaction: the formation of a liquid phase inside expansion 
engines causes mechanical operating difficulties. The final liquefaction stage 
is usually a Joule-Thomson stage, discussed below. Helium and hydrogen 
liquefiers usually contain two or more expansion engines at successive tem- 
peratures, with multiple heat exchangers. 

The principle of cooling by isentropic expansion of an ideal gas is applicable 
to the electron gas in semiconductors. When electrons flow from a semi- 
conductor with high electron concentration into a semiconductor with a lower 
electron concentration, the electron gas expands and does work against the 
potential barrier between the two substances that equalizes the two chemical 
potentials. The resulting electronic cooling, called the Peltier effect, is used 
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down to about 195K quite routinely; in multistage units temperatures down 
to 135K have been achieved. 


Gas Liquefaction by the Joule-Thomson Effect 


Intermolecular attractive interactions cause the condensation of all gases. At 
temperatures slightly above the condensation temperature the interactions are 
strong enough that work against them during expansion causes significant 
cooling of the gas. If the cooling is sufficient, part of the gas will condense. 
This process is Joule-Thomson liquefaction. 

The practical implementation is simple. Gas at pressure p, is forced through 
a constriction called an expansion valve into space with a lower pressure p2, 
as in Figure 12.2. The work is the difference between the displacement work 
—p,dV, done on the gas in pushing it through the expansion valve and the 
displacement work +p dV2 recovered from the gas on the downstream side. 
Here dV, is negative and dV, is positive. 

The overall process is at constant enthalpy. To see this, notice that the 
expansion valve acts as an expansion engine that extracts zero work. With 
W = Oin (2), we have H, = H, in the Joule-Thomson effect. For an ideal gas 
H = 4Nr, so that t, = 72 in the expansion. There is zero cooling effect for an 
ideal gas. 

In real gases a small temperature change occurs because of the internal 
work done by the molecules during expansion. The sign of the temperature 


Py Vy { 


Px Vo 


— #s™*"————- 


Expansion valve 


Figure 12.2 The Joule-Thomson effect. A gas is pushed 
through an expansion value. If the gas is nonideal, there will 
be a temperature change during the expansion because of work 
done against the intermolecular forces. If the temperature is 
initially below a certain inversion temperature, t;,,,, the gas 
will cool on Joule-Thomson expansion. 
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Table 12.1 Liquefaction data for low boiling gases 


Ty. oO Lieve AH, VY, AH/V, 
Gas K K K kJ/mol cm?/mol | watt hr/liter 
co, 195 304 (2050) 25.2 22.3 314 
CH, 112 191 (1290) 8.18 34.4 | 66 
O; 90.2 155 893 6.82 28.1 | 67 
N, 77.3 126 621 5.57 34.6 | 45 
H, 20.4 33.3 205 0.90 28.6 8.7 
+He 4.18 5.25 al 0.082 32.0 0.71 
sHe 3.20 3.35; (23) 0.025 50.8 0.14 


NoTE: T;, = atmospheric-pressure boiling temperature; T, = critical temperature; T,, = Joule- 
Thomson inversion temperature; AH = molar latent heat of vaporization; , = molar volume of 
the liquid. The last column, AH/V, indicates the heat in watts that can be taken up for a refrigerant 
consumption of | liter per hour; Tj,, values in parentheses are van der Waals values calculated 
from T, and not measured value, 

Carbon dioxide solidifies when cooled at atmospheric pressure, because its triple point occurs 
above atmospheric pressure. Solid CO, is known as dry ice. Methane, CH, is the principal con- 
stituent of natural gas, which is liquefied in huge quantities for shipping as LNG fuel. Liquid 
oxygen and nitrogen are separated in the liquefaction of air. For helium, we give data both for the 
common isotope *He and for *He. 


change during a Joule-Thomson expansion depends on the initial temperature. 
All gases have an inversion temperature t;,, below which such an expansion 
cools, above which it heats the gas. Inversion temperatures for common gases 
are listed in Table 12.1. 


Example: Joule-Thomson effect for van der Waals gas. We found in (10.75) that 
H = 3Nt + (N2/V)(bt — 2a) (4) 


for a van der Waals gas, where a and b are positive constants. The last two terms are the 
corrections caused by the short range repulsion and the long range attraction. The correc- 
tions have opposite signs. The total correction changes sign at the temperature 


Tiny = 2a/b = 7Pr,, (5) 


where rt, is the critical temperature, defined by (10.46). 

The temperature 7,,, is the inversion temperature. For t < tip) the enthalpy at fixed 
temperature increases as the volume increases; here in expansion the work done against the 
attractive interactions between molecules is dominant. In a process at constant enthalpy 
this increase is compensated by a decrease of the 3Nr term, that is, by cooling the gas. For 


Gas Liquefaction by the Joule-Thomson Effect 


T > Tiny the enthalpy at a fixed temperature decreases because now the work done by the 
strong short range repulsive interactions is dominant: at the higher temperature the 
molecules penetrate farther into the repulsive region. 


Linde cycle. n gas liquefiers the Joule-Thomson expansion is combined with 
a counterflow heat exchanger, as shown in Figure 12.3. The combination is 
called a Linde cycle, after Carl von Linde who used such a cycle in 1895 to 
liquefy air starting from room temperature. In our discussion we assume that 
the expanded gas returning from the heat exchanger is at the same temperature 
as the compressed gas entering it. We neglect any pressure difference between 
the output of the heat exchanger and the pressure above the liquid. 


To and from compressor 
or precooling stages 


L | 


Heat exchanger 


a counterflow heat exchanger. 


JT expansion valve 


Liquefied gas 


Figure 12.3 The Linde cycle. Gas is liquefied 
by combining Joule-Thomson expansion with 
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0.4 - 


Liquefaction coefficient A 


Refrigeration load Q,,,, i 


0 10 20 30 40 
Input pressure, atm 


Figure 12.4 Performance of helium liquefiers operating by the Linde cycle, 
as a function of the input pressure, for an output pressure of 1 atm and 

for various values of the input temperature. The solid curves give the 
liquefaction coefficient. The broken curves give Qi, = Hoy — Hin, the 
internal refrigeration load available at 4.2 K if the load is placed inside the 
liquefier and the still cold helium gas boiled off by the load is returned 
through the heat exchanger rather than boiled off into the atmosphere. 

See Problem 3. After A. J, Croft in Advanced cryogenics (C. A. Bailey, ed.), 
Plenum, 1971, p. 187. 


Evaporation Cooling: Pumped Helium, to 0.3 K 


The combination heat exchanger-expansion valve is a constant enthalpy 
arrangement. Let one mole of gas enter the combination; suppose that the 
fraction / is liquefied. Constant enthalpy requires that 


Hin = AMjig + (1 — AHow- (6) 


Here H,, = H(TinsPin) and Hoy = H(TinPow) are the enthalpies per mole of 
gas at the input and output pressures, both at the common upper temperature 
of the heat exchanger. H),, is the enthalpy per mole of liquid at its boiling tem- 
perature under the pressure p,,,. From (6) we obtain the fraction 


Hou — Hin 


i, 
Ayu = Aig 


(7) 


called the liquefaction coefficient. 
Liquefaction takes place when Hy > Hjq; that is, when 


H(Tin:Pow) > H(TinsPin)s (8) 


Only the enthalpies at the input temperature of the heat exchanger matter. If 
the Joule-Thomson expansion at this temperature cools the gas, liquefaction 
will take place. 

The three enthalpies in (7) are known experimentally. Figure 12.4 shows the 
liquefaction coefficient calculated from them for helium. The liquefaction 
coefficient drops rapidly with increasing T;,, because of the decrease of the 
numerator in (7) and the increase of the denominator. To obtain useful lique- 
faction, say A > 0.1, input temperatures below one-third of the inversion 
temperature are usually required. For many gases this requires precooling of 
the gas by an expansion engine. The combination of an expansion engine and 
a Linde cycle is called a Claude cycle. The expansion engine is invariably 
preceded by another heat exchanger, as in Figure 12.1. 


Evaporation Cooling: Pumped Helium, to 0.3 K 


Starting from liquid helium, the simplest route to lower temperatures is by 
evaporation cooling of the liquid helium, by pumping away helium vapor. The 
latent heat of vaporization of the liquid helium is extracted along with the vapor. 
The heat extraction causes the further cooling: work is done against the inter- 
atomic forces that caused the helium to liquefy in the first place. In Joule- 
Thomson cooling the initial state is a gas, while in evaporation cooling the 
initial state is a liquid. 
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Table 12.2 Temperatures, in kelvin, at which the vapor pressures of *He and 
*He reach specified values 


p (torr) 10°% 10-3 107+ sn ih 10 100 
*He 0.56 0.66 0.79 0.98 1.27 1.74 2.64 
3He 0.23 0.28 0.36 0.47 0.66 1.03 1.79 


The lowest temperature accessible by evaporation cooling of liquid helium 
is a problem in vacuum technology (Chapter 14). As the temperature drops, the 
equilibrium vapor pressure drops (Table 12.2) and so does the rate at which 
helium gas and its heat of vaporization can be extracted from the liquid helium 
bath. 

Evaporation cooling is the dominant cooling principle in everyday cooling 
devices such as household refrigerators and freezers and in air conditioners. 
The only difference is in the working substance. 


Helium Dilution Refrigerator: Millidegrees 


Once the equilibrium vapor pressure of liquid *He has dropped to 107 torr, 
classical refrigeration principles lose their utility. The temperature range from 
0.6K to 0.01 K is dominated by the helium dilution refrigerator, which is an 
evaporation refrigerator in a very clever quantum disguise.* 

We saw in Chapter 7 that *He atoms are bosons, while *He atoms are fer- 
mions. This distinction is not important at temperatures appreciably higher 
than the superfluid transition temperature of *He, 2.17K. However, the two 
isotopes behave as altogether different substances at lower temperatures. Below 
0.87 K liquid "He and “He are immiscible over a wide composition range, like 
oil and water. This was discussed in Chapter 11 and is shown in the phase 
diagram of *He—*He mixtures in Figure 11.7. A mixture with composition in 
the range labeled unstable will decompose into two separate phases whose 
compositions are given by the two branches of the curve enclosing that area. 
The concentrated *He phase floats on top of the dilute “He phase. 

As T > 0, the *He concentration of the phase dilute in *He drops to about 
6 pet, and the phase rich in *He becomes essentially pure *He. Consider a liquid 


* For good reviews, see D, S. Betts, Contemporary Physics 9, 97 (1968); J. C. Wheatley, Am. J, Phys. 
36, 181 (1968); for a general review of cooling techniques below 1 K see W. J. Huiskamp and O. V. 
Lounasmaa, Repts. Prog. Phys. 36, 423 (1973); O. V. Lounasmaa, Experimental principles and 
methods below 1 K, Academic Press, New York, 1974. A very elementary account is O. V. Lounasmaa, 
Scientific American 221, 26 (1969). 


Helium Dilution Refrigerator: Millidegrees 


SHe 
“evaporation” 


4He 


Figure 12.5 Cooling principle of the helium dilution refrigerator. Liquid 
3He is in equilibrium with a He-*He mixture. When “He is added to the 
mixture, *He evaporates from the pure *He fluid and absorbs heat in the 
process. 


*He-*He mixture with more than 6 pet *He at a temperature in the millidegree 
range, near the bottom of Figure 11.7. At these temperatures almost all the *He 
atoms have condensed into the ground state orbital. Their entropy is negligible 
compared to that of the remaining *He atoms, which then behave as if they were 
present alone, as a gas occupying the volume of the mixture. If the 3He concen- 
tration exceeds 6 pct, the excess condenses into concentrated liquid 3He and 
latent heat is liberated. If concentrated liquid *He is evaporated into the *He 
rich phase, the latent heat is consumed. The principle of evaporation cooling 
can again be applied: this is the basis of the helium dilution refrigerator. 

To see how the solution of *He can be employed to obtain refrigeration, 
consider the equilibrium between the concentrated *He liquid phase and the 
dilute *He gas-like phase (Figure 12.5). Suppose that the *He:*He ratio of 
the dilute phase is decreased, as by dilution with pure *He. In order to restore 
the equilibrium concentration, *He atoms will evaporate from the concentrated 
3He liquid. Cooling will result. 

To obtain a cyclic process the *He—*He mixture must be separated again. 
The most common method is by distillation, using the different equilibrium 
vapor pressures of *He and *He (Table 12.2). Figure 12.6 shows a schematic 
diagram of a refrigerator built on these principles. The diagram is highly 
oversimplified. In particular, in actual refrigerators the heat exchanger between 
the mixing chamber and the still has an elaborate multistage design. An alternate 
method* to separate the *He-*He mixture utilizes the superfluidity of *He 
below 2.17K. For a variety of practical reasons it is less commonly used, 
although its performance is excellent. 
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Helium Dilution Refrigerator: Millidegrees 


Figure 12.6 Helium dilution refrigerator. Precooled liquid He enters a mixing 
chamber at the lower end of the assembly, where cooling takes place by the quasi- 
evaporation of the *He atoms into the denser *He-*He mixed phase underneath. 
The quasi-gas of 3H atoms dissolved in liquid *He then diffuses through a counterflow 
heat exchanger into a still. There the *He is distilled from the *He-*He mixture 
selectively, and is pumped off. To obtain a useful *He evaporation and circulation 
rate, heat must be added to the still, to raise its temperature to about 0.7 K, at which 
temperature the *He vapor pressure is still much smaller. Thus, the *He does not 
circulate to any appreciable extent; the *He moves through a nearly stationary 
background of *He. The pumped-off *He is returned to the system and is condensed 
in a condenser that is cooled to about 1 K by contact with a pumped *He bath. The 
constriction below the condenser takes up the excess pressure generated by the 
circulation pump over the pressure in the still. The liquified *He is cooled further, 
first in the still, then in the counterflow heat exchanger, before re-entering the mixing 
chamber. 


The helium dilution refrigerator has a low temperature limit. In the conven- 
tional evaporation refrigerator this limit arose because of the disappearance of 
the gas phase, but the quasi-gas phase of *He persists down to t = 0. However, 
the heat of quasi-vaporization of *He vanishes proportionally to t?, and as a 
result, the heat removal rate from the mixing chamber vanishes as t*. The 
practical low temperature limit is about 10 mK. In one representative device* 
a temperature of 8.3mK has been achieved; the same device was capable of 
removing 40 1. W at 80mK. 

Temperatures below 8mK can be achieved by single shot operation. If, in 
the design of Figure 12.6, we shut off the “He supply after some time of operation, 
there is no need to cool the incoming *He itself, and the temperature of the 
mixing chamber drops below its steady state value, until all 3He has been 
removed from the chamber. 

The dilution refrigerator is not the only cooling method in the millikelvin 
range that utilizes the peculiar properties of *He. An alternate method, known 
as Pomeranchuk cooling, utilizes the phase diagram of *He, as shown in Fig- 
ure 7.15, with its negative slope of the phase boundary between liquid and 
solid *He. The interested reader is referred to the reviews by Huiskamp and 
Lounasmaa, and by Lounasmaa, cited earlier. 


* N. H. Pennings, R. de Bruyn Ouboter, K. W. Taconis, Physica B 81, 101 (1976), and Physica B 
84, 102 (1976). 
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ISENTROPIC DEMAGNETIZATION: 
QUEST FOR ABSOLUTE ZERO 


Below 0.01 K the dominant cooling process is the isentropic (adiabatic) demag- 
netization of a paramagnetic substance. By this process, temperatures of 1 mK 
have been attained with electronic paramagnetic systems and 1 wK with nuclear 
paramagnetic systems. The method depends on the fact that at a fixed tempera- 
ture the entropy of a system of magnetic moments is lowered by application of a 
magnetic field—essentially because fewer states are accessible to the system 
when the level splitting is large than when the level splitting is small. Examples 
of the dependence of the entropy on the magnetic field were given in Chapters 2 
and 3. 

We first apply a magnetic field B, at constant temperature t,. The spin excess 
will attain a value appropriate to the value of B,/t,. If the magnetic field is then 
reduced to B, without changing the entropy of the spin system, the spin excess 
will remain unchanged, which means that B,/r, will equal B,/t,. If By « B,, 
then t, « t,. When the specimen is demagnetized isentropically, entropy can 
flow into the spin system only from the system of lattice vibrations, as in Fig- 
ure 12.7. At the temperatures of interest the entropy of the lattice vibrations is 
usually negligible; thus the entropy of the spin system will be essentially constant 
during isentropic demagnetization of the specimen. 


Lattice 


Entropy 
Temperature 


Before New equilibrium Before { New equilibrium 
Time at which Time at which 
magnetic field magnetic field 

is removed is removed 


Figure 12.7 During isentropic demagnetization the total entropy of the s 
specimen is constant. The initial entropy of the lattice should be small in 
comparison with the entropy of the spin system in order to obtain significant 
cooling of the lattice. 


Isentropic Demagnetization: Quest for Absolute Zero 


B = 0; By = 100 gauss 


Spin entropy, in units Nkg, 


T, in mK 


Figure 12.8 Entropy for a spin } system as a function of temperature, assuming 
an internal random magnetic field B, of 100 gauss. The specimen is magnetized 
isothermally along ab, and is then insulated thermally. The external magnetic 
field is turned off along be. In order to keep the figure on a reasonable scale 

the initial temperature T, and the external magnetic field are lower than would be 
used in practice. 


The steps carried out in the cooling process are shown in Figure 12.8. The 
field is applied at temperature t, with the specimen in good thermal contact 
with the surroundings, giving the isothermal path ab. The specimen is then 
insulated (Ac = 0) and the field removed; the specimen follows the constant 
entropy path be, ending up at temperature t,. The thermal contact at 1, is 
provided by helium gas, and the thermal contact is broken by removing the 
gas with a pump. 

The population of a magnetic sublevel is a function only of mB/t, where m 
is the magnetic moment of a spin. The spin-system entropy is a function only 
of the population distribution; hence the spin entropy is a function only of mB/t. 
If By is the effective field that corresponds to the diverse local interactions among 
the spins or of the spins with the lattice, the final temperature t, reached in an 
isentropic demagnetization experiment is 


T = T1(Ba/B), (9) 


where B is the initial field and 7, the initial temperature. Results are shown in 
Figure 12.9 for the paramagnetic salt known as CMN, which denotes cerous 
magnesium nitrate. 
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Final magnetic field, in 10? gauss 


0 0.1 0.2 0.3 0.4 0.5 0.6 
Final temperature, in K 


Figure 12.9 Final magnetic field B, versus final 
temperature T ; for magnetic cooling of cerous magnesium 
nitrate. In these experiments the magnetic field was not 
removed entirely, but only to the indicated values. The 
initial fields and temperatures were identical in all runs. 
After unpublished results of J. S. Hill and J. H. Milner, 
as cited by N. Kurti, Nuovo Cimento (Supplemento) 6, 
1109 (1957), 


The process described so far is a single shot process. It is easily converted 
into a cyclic process by thermally disconnecting, in one way or another, the 
demagnetized working substance from the load, reconnecting it to the reservoir 
at t,, and repeating the process.* 


Nuclear Demagnetization 


Because nuclear magnetic moments are weak, nuclear magnetic interactions 
are much weaker than similar electronic interactions. We expect to reach a 
temperature 100 times lower with a nuclear paramagnet than with an electron 
paramagnet. The initial temperature of the nuclear stage in a nuclear spin- 


*C. V. Heer, C. B. Barnes, and J. G. Daunt, Rev. Sci. Inst. 25, 1088 (1954); W. P. Pratt, S. S. 
Rosenblum, W. A. Steyert, and J. A. Barclay, Cryogenics 17, 381 (1977). 


Nuclear Demagnetization 


Initial magnetic field in kG 
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Final temperature, in wK 
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Initial B/T in 10° G/K 


Figure 12.10 Nuclear demagnetizations of copper 
nuclei in the metal, starting from 0.012 K and various 
fields. After M. V. Hobden and N, Kurti, Phil. Mag. 
4, 1902 (1959). 


cooling experiment must be lower than in an electron spin-cooling experiment. 
Ifwestartat B = 50kGand 7, = 0.01 K, then mB/kgT, ~ 0.5, and the entropy 
decrease on magnetization is over 10 percent of the maximum spin entropy. 
This is sufficient to overwhelm the lattice and from (9) we estimate a final 
temperature T, ~ 10~’K. The first nuclear cooling experiment was carried 
out by Kurti and coworkers on Cu nuclei in the metal, starting from a first 
stage at about 0.02K as attained by electron demagnetization cooling. The 
lowest temperature reached in this experiment was 1.2 x 10~°K. The results 
in Figure 12.10 fit a line of the form of (9): T, = T,(3.1/B) with B in gauss, so 
that B, = 3.1 gauss. This is the effective interaction field of the magnetic mo- 
ments of the Cu nuclei. The motivation for using nuclei in a metal rather than 
in an insulator is that conduction electrons help ensure rapid thermal contact 
of lattice and nuclei at the temperature of the first stage. 

Temperatures below | /K have been achieved in experiments in which the 
cooling load was the system of nuclear spins itself, particularly in experiments 
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that were combinations of cooling experiments and nuclear magnetic resonance 
experiments,* 


SUMMARY 


1. The two dominant principles of the production of low temperatures are the 
cooling of a gas by letting it do work against a force during an expansion 
and the isentropic demagnetization of a paramagnetic substance. 


2. Joule-Thomson cooling is an irreversible process in which work is done 
against interatomic attractive forces in a gas. It is used as the last cooling 
stage in liquefying low-boiling gases. 


3. In evaporation cooling the work is also done against the interatomic forces, 
but starting from the liquid phase rather than the gas phase. Using different 
working substances, evaporation cooling forms the basis of household 
cooling devices, automobile air conditioners, and laboratory cooling devices 
(in the range 4K down to 10mK). 


4, The helium dilution refrigerator is an evaporation cooling device in which 
the gas is the virtual gas of *He atoms dissolved in *He. 


5. Isentropic demagnetization utilizes the lowering of the temperature of a 
system of magnetic moments, when an external magnetic field is reduced in 
strength. The magnetic moments may be electronic or nuclear moments. 
By using nuclear moments, temperatures in the microkelvin range may be 
achieved, 


PROBLEMS 


1. Helium as a van der Waals gas. (a) Estimate the liquefaction coefficient 1 
for helium by treating it as a van der Waals gas. Select the van der Waals 
coefficients a and b in such a way that for one mole 2Nb is the actual molar 
volume of liquid helium and that 2a/b is the actual inversion temperature, 
Use the data in Table 12.1. Approximate the denominator in (7) by setting 


Aon — Ang = AH + 3£tin — Tig) (10) 


* See, for example, M. Chapellier, M, Goldman, V. H. Chau and A. Abragam, Appl. Phys. 41, 
849 (1970). 


Problems 


where AH is the latent heat of vaporization of liquid helium. (Explain how this 
approximation arises if one treats the expanded gas as an ideal gas). The 
resulting expression gives / as a function of the molar volumes V,, and V,,,. 
Convert to pressures by approximating the V’s via the ideal gas law. (b) Insert 
numerical values for T = 15K and compare with Figure 12.4. 


2. Ideal Carnot liquefier. (a) Calculate the work W, that would be required to 
liquefy one mole of a monatomic ideal gas if the liquefier operated reversibly. 
Assume that the gas is supplied at room temperature Ty, and under the same 
pressure po at which the liquefied gas is removed, typically 1 atmosphere. Let 
T, be the boiling temperature of the gas at this pressure, and AH the latent heat 
of vaporization. Show that under these conditions 


Don Mig *) To — Ty 
W, = 3RT x | lo; + x AH, (11) 
1, =43RTo ( g T Ta Th 


To derive (11) assume that the gas is first cooled at fixed pressure pp from To 
to T,, by means of a reversible refrigerator that operates between the fixed 
upper temperature T;, = To and a variable lower temperature equal to the gas 
temperature. Initially T, = To, and at the end T, = T,,. After reaching T, the 
refrigerator extracts the latent heat of vaporization at the fixed lower tempera- 
ture T,. (b) Insert T) = 300K and values for T, and AH characteristic of 
helium. Re-express the result as kilowatt-hours per liter of liquid helium. 
Actual helium liquefiers consume 5 to 10 kWh/liter. 


3. Claude cycle helium liquefier. Consider a helium liquefier in which 1 mols”! 


of gas enters the Linde stage at T;, = 15K and at a pressure p,, = 30atm. 
(a) Calculate the rate of liquefaction, in liter hr~ '. Suppose that all the liquefied 
helium is withdrawn to cool an external experimental apparatus, releasing the 
boiled-off helium vapor into the atmosphere. Calculate the cooling load in 
watts sufficient to evaporate the helium at the rate it is liquefied. Compare this 
with the cooling load obtainable if the liquefier is operated as a closed-cycle 
refrigerator by placing the apparatus into the liquid collection vessel of the 
liquefier, so that the still cold boiled-off helium gas is returned through the heat 
exchangers. (b) Assume that the heat exchanger between compressor and ex- 
pansion engine (Figure 12.1) is sufficiently ideal that the expanded return gas 
that leaves it with pressure p,,, is at essentially the same temperature T, as the 
compressed gas entering it with pressure p,. Show that under ordinary liquefier 
operation the expansion engine must extract the work 


W. = H(T..p.) — H(TinsPin) 


— (1 = ALH(T Pow) — H(TinPow)] = 34.R(T. — Tin), (12) 
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per mole of compressed gas. Here Tin, Pins Pour and A have the same meaning 
as in the Linde cycle section of this chapter. Assume the expansion engine 
operates isentropically between the pressure-temperature pairs (p.,T,) and 
(Pins Tin). From (12) and the given values of (pin. Tin), calculate (p., 7). (c) Estimate 
the minimum compressor power required to operate the liquefier, by assuming 
that the compression is isothermal from p,,, to p, at temperature T, = 50°C. 
Combine the result with the cooling loads calculated under (a) into a coefficient 
of refrigerator performance, for both modes of operation. Compare with the 
Carnot limit. 


4. Evaporation cooling limit. Estimate the lowest temperature T,,;,, that can 
be achieved by evaporation cooling of liquid *He if the cooling load is 0.1 W 
and the vacuum pump has a pump speed S = 10? liters~'. Assume that the 
helium vapor pressure above the boiling helium is equal to the equilibrium 
vapor pressure corresponding to T),i,, and assume that the helium gas warms 
up to room temperature and expands accordingly before it enters the pump. 
Note: The molar volume of an ideal gas at room temperature and atmospheric 
pressure (760 torr) is about 24 liters. Repeat the calculation for a much smaller 
heat load (10-3 W) and a faster pump (10° liter s~'), Pump speed is defined in 
Chapter 14. 


5. Initial temperature for demagnetization cooling. Consider a paramagnetic 
salt with a Debye temperature (Chapter 4) of 100K. A magnetic field of 100kG 
or 10 tesla is available in the laboratory. Estimate the temperature to which the 
salt must be precooled by other means in order that significant magnetic cooling 
may subsequently be obtained by the isentropic demagnetization process. Take 
the magnetic moment of a paramagnetic ion to be 1 Bohr magneton. By signifi- 
cant cooling we may understand cooling to 0.1 of the initial temperature. 
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Note: This chapter is written for students with a professional interest in semiconductors. We assume 
familiarity with conduction and valence bands; electrons and holes; donors and acceptors. The 
notation is that: 


ny 


concentration of conduction electrons; 

concentration of holes; 

value of n, or n, for an intrinsic semiconductor; 

effective quantum concentration for conduction electrons; 
effective quantum concentration for holes. 


In the semiconductor literature n, and n, are called the effective densities of states for the conduction 
and valence bands, Notice that we use j for the chemical potential or Fermi level, and we use ji for 
carrier mobilities. 


Energy Bands; Fermi Level; Electrons and Holes 


ENERGY BANDS; FERMI LEVEL; 
ELECTRONS AND HOLES 


The application of the Fermi-Dirac distribution to electrons in semiconductors 
is central to the design and operation of all semiconductor devices, and thus 
to much of modern electronics. We treat below those aspects of the physics 
of semiconductors and semiconductor devices that are parts of thermal physics. 
We assume that the reader is familiar with the basic ideas of the physics of 
electrons in crystalline solids, as treated in the texts on solid state physics 
and on semiconductor devices cited in the general references. We assume the 
concept of energy bands and of conduction by electrons and holes. Our principal 
aim is to understand the dependence of the all-important concentrations of 
conduction electrons and of holes upon the impurity concentration and the 
temperature. 


A semiconductor is a system with electron orbitals grouped into two energy 
bands separated by an energy gap (Figure 13.1). The lower band is the valence 


band and the upper band is the conduction band.* In a pure semiconductor at 
t = (all valence band orbitals are occupied and all conduction band orbitals 
are empty. A full band cannot carry any current, so that a pure semiconductor 
at t = 0 is an insulator. Finite conductivity in a semiconductor follows either 
from the presence of electrons, called conduction electrons, in the conduction 
band or from unoccupied orbitals in the valence band, called holes. 

Two different mechanisms give rise to conduction electrons and holes: 
Thermal excitation of electrons from the valence band to the conduction band, 
or the presence of impurities that change the balance between the number 
of orbitals in the valence band and the number of electrons available to fill them. 

We denote the energy of the top of the valence band by ¢,, and the energy 
of the bottom of the conduction band by ¢,. The difference 


8 = & — b, (1) 


is the energy gap of the semiconductor. For typical semiconductors e, is between 
0.1 and 2.5 electron volts. In silicon, é, = 1.1 eV. Because t ~ 1/40 eV at room 


* We treat both bands as single bands; for our purposes it does not matter that both may be groups 
of bands with additional gaps within each group. 
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Figure 13.1 Energy band structure of a pure semiconductor or insulator. 
The electron orbitals occur in bands which extend through the crystal. 
Att = Oall orbitals up to the top of the valence band are filled, and the 
conduction band is empty. The energy interval between the bands is called 
the energy gap. 


temperature, we usually have ¢, > t. Substances with a gap of more than about 
2.5eV are usually insulators. Table 13.1 gives the energy gaps for selected 
semiconductors, together with other properties needed later. 

Let n, denote the concentration of conduction electrons and n, the con- 
centration of holes. In a pure semiconductor the two will be equal: 


Ne = Nyy (2) 


if the crystal is electrically neutral. 

Most semiconductors as used in devices have been intentionally doped with 
impurities that may become thermally ionized in the semiconductor at room 
temperature. Impurities that give an electron to the crystal (and become 
positively charged in the process) are called donors. Impurities that accept 
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Table 13.1 Band structure data of some important semiconductors 


| Density-of-states Dielectric 

Energy | Quantum concentrations effective masses, constants, 

gaps at of electrons and holes in units of the relative to 

300K at 300K free electron mass vacuum 
Eg» Ney Wey m.*/m — m,*/m €/Eo 

eV cm cm™? 

Si 1.14 2.7 x 10'9 11 x 10! | 1.06 0.58 Lay 
Ge 0.67 1.0 x 10!° 5.2.x 10%% 0.56 0.35 15.8 
GaAs | 1.43 4.6 x 10!” PSOne 0.07 0.71 13,13 
InP 1.35 49 x 10!” 69 x 10!8 0.073 0.42 12.37 
InSb 0.18 4.6 x 10'° 6.2 x 10'* 0.015 0.39 17.88 


an electron from the valence band (and become negatively charged in the 
process) are called acceptors. 

Let n,* be the concentration of positively charged donors and n,~ the 
concentration of negatively charged acceptors. The difference 


An=n* — 1,7 (3) 


is called the net ionized donor concentration. The electrical neutrality condition 
becomes 


ne — m = An=n,t —n,", (4) 


which specifies the difference between electron and hole concentrations. 
The electron concentration may be calculated from the Fermi-Dirac dis- 
tribution function of Chapter 6: 


1 


exp[(e — w/t] + 1° 6) 


fe) = 


where yi is the chemical potential of the electrons. The subscript e refers to 
electrons. In semiconductor theory the electron chemical potential is always 
called the Fermi level. Further, in semiconductor theory the character 1 is 
almost always reserved for the electron and hole mobilities, and the Fermi 
level is designated by e, or by ¢. To avoid confusion with the Fermi energy 
of a metal which we designated as ¢, and which stands for the Fermi level 
in the limit t > 0, we shall maintain our previous usage of the letter j for the 
chemical potential at any temperature. 
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Given y and t, the number of conduction electrons is obtained by summing 
the distribution function f,(e) over all conduction band orbitals: 


Ne =» See). (6) 
CB 
The number of holes is 
N,= YT - £0] = ¥ file), (7) 
VB VB 


where the summation is over all valence band orbitals. Here we have introduced 
the quantity 


1 
exp[(u — e)/t] +1” 


file) = 1 — file) = (8) 


which is the probability that an orbital at energy ¢ is unoccupied. We say 
that the unoccupied orbital is “occupied by a hole”; then /,(¢) is the distribution 
function for holes just as f,(e) is the distribution function for electrons. Com- 
parison of (8) with (5) shows that the hole occupation probability involves 
jt — & where the electron occupation probability involves ¢ — 

The concentrations n, = N,/V and n, = N,/V depend on the Fermi level. 
But what is the value of the Fermi level? It is determined by the electrical 
neutrality requirement (4), now written as n.(j) — n(u) = An. This is an 
implicit equation for 1; to solve the equation we must determine the functional 
dependences n,(1) and n,(). 


Classical Regime 


We assume that both electron and hole concentrations are in the classical 
regime defined by the requirements that f, « 1 and f, « 1, as in Chapter 6, 
This will be true if, as in Figure 13.2, the Fermi level lies inside the energy gap 
and is separated from both band edges by energies large enough that 


exp[—(e. — w/t] «1; exp[—(u — 4,)/t] « 1. (9) 


To satisfy (9) both (e. — 4) and (4 — ¢,) have to be positive and at least a 
few times larger than t. Such a semiconductor is called nondegenerate. The 
inequalities (9) place upper limits on the electron and hole concentrations and 
are satisfied in many applications. With (9) the two occupation probabilities 
fAe) and f,(e) reduce to classical distributions: 
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1=fe) <<! band 
fle) 1 
i] 
Figure 13.2 Occupancy of orbitals at a finite temperature, according to the Fermi- 
Dirac distribution function. The conduction and valence bands may be represented 
in terms of temperature-dependent effective numbers N,, N,, of degenerate orbitals 
located at the two band edges ¢,, ¢,. The n,, n, are the corresponding quantum 
concentrations. 
Se) = exp[—(e — w/t]; file) = exp[—(u — 8)/z]. (10) 


We use (6) and (10) to write the total number of conduction electrons in the 
form 


N.= Yexp[-(e - wt] = expC wie] Zexrl—e—ea/]} 
CB cB 


or 


N, = N.exp[—(& — »)/t], (11) 
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where we define 
N. = iexp[—(e — «,)/t]- (12) 
cB 


Here « — ¢, is the energy of a conduction electron referred to the conduction 
band edge «, as origin. 

The expression for N, has the mathematical form of a partition function 
for one electron in the conduction band. In Chapter 3 we evaluated a similar 
sum denoted there by Z,, and we can adapt that result to the present problem 
with an approximate modification for band structure effects. Because of the 
rapid decrease of exp[_—(e — ¢,)/t] as e increases above its minimum value 
at ¢,, only the distribution of orbitals within a range of a few t above «, really 
matters in the evaluation of the sum in (12). The orbitals high in the band 
make a negligible contribution. The important point is that near the band edge 
the electrons behave very much like free particles. Not only are the electrons 
mobile, which causes the conductivity of the semiconductor, but the energy 
distribution of the orbitals near the band edge usually differs from that of free 
particles only by a proportionality factor in the energy and eventually in the 
sum for Z,. 

We can arrange for a suitable proportionality factor by use of a device 
called the density-of-states effective mass. For free particles we calculated the 
partition function Z, in (3.62), but for zero spin. For particles of spin 4 the 
result is larger by a factor of 2, so that (12) becomes 


N, = Z, = 2ngV = 2(mt/2nh?)??V. (13) 
Numerically, this gives 
N,/V = 2.509 x 10!° x (7/300K)*/2cm~3 , (14) 


where T is in kelvin. 

The quantity N, for actual semiconductors exhibits the same temperature 
dependence as (13), but differs in magnitude by a proportionality factor. We 
express this formally by writing, in analogy to (13), 


N, = 2(m,*t/2nh?)??V , (15) 


where m,* is called the density-of-states effective mass for electrons. Experi- 
mental values are given in Table 13.1. The introduction of effective masses 
is more than a formality. In the theory of electrons in crystals it is shown that 


Classical Regime 


the dynamical behavior of electrons and holes, under the influence of external 
forces such as electric fields, is that of particles with effective masses different 
from the free electron mass. The dynamical masses usually are different from 
the density-of-states masses, however. 

We define the quantum concentration n, for conduction electrons as 


n, = N,/V = 2(m,*1/2nh?)??. | (16) 


By (11) the conduction electron concentration n, = N,/V becomes 


| n. =n,exp[—(e, — y)/T]. | (17) 


The earlier assumption (9) is equivalent to the assumption that n, « n,, so 
that the conduction electrons act as an ideal gas. As an aid to memory, we may 
think of N, as arising from N, orbitals at ¢,, with the Fermi level at ju. Warning: 
In the semiconductor literature n, is invariably called the effective density of 
states of the conduction band. 

Similar reasoning gives the number of holes in the valence band: 


N, = Yexp[—(u — 8/7] = Nyexp[—(u — &)/t], (18) 
VB 
with the definition 


N, = Yexp[-(e, — 8)/t]. (19) 
VB 


We define the quantum concentration n, for holes as 


| n, = N,/V = 2(m,*1/2nh?)3?? , | (20) 


where m,* is the density-of-states effective mass for holes. By (18) the hole 
concentration n, = N,,/V is 


| m = nyexpl—(u — &,)/2]. | (21) 
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Like (17), this gives the carrier concentration in terms of the quantum con- 
centration and the position of the Fermi level relative to the valence band 
edge. In the semiconductor literature n, is called the effective density of states 
of the valence band. 


Law of Mass Action 
The product n,n, is independent of the Fermi level so long as the concentrations 
are in the classical regime. Then 


nen, = nn, exp[ —(e. — &)/t] = nn, exp(—é,/t) , (22a) 


where the energy gap ¢, = &, — é,. In a pure semiconductor we have n, = Mp, 
and the common value of the two concentrations is called the intrinsic carrier 
concentration n, of the semiconductor. By (22a), 


| n; = (nn,)'? exp(—¢,/2z). | (22b) 


The Fermi level independence of the product n,m, means that this product 
retains its value even when n, # n,, as in the presence of electrically charged 
impurity atoms, provided both concentrations remain in the classical regime. 
We may then write (22a) as 


i 71 
non, = n?. (22c) 


The value of the product depends only on the temperature. This result is the 
mass action law of semiconductors, similar to the chemical mass action law 
(Chapter 9). 


Intrinsic Fermi Level 


For an intrinsic semiconductor n, = nj, and we may equate the right-hand 
sides of (17) and (22b): 


neexpl —(& — 1)/t] = (nem)"/? exp(—e,/22). (23) 
Insert ¢, = &. — ¢, and divide by n,exp(—e,/t): 


exp(yu/t) = (n,/n,)"’? exp[(e. + &)/2z]. 


Donors and Acceptors 


We take logarithms to obtain 
B= 4(e, + &,) + 4rlog(n,/n,) = $e + &) + $clog(m,*/m.*), — (24) 


by use of (16) and (20). The Fermi level for an intrinsic semiconductor lies near 
the middle of the forbidden gap, but displaced from the exact middle by an 
amount that is usually small. 


n-TYPE AND p-TYPE SEMICONDUCTORS 


Donors and Acceptors 


Pure semiconductors are an idealization of little practical interest. Semicon- 
ductors used in devices usually have impurities intentionally added in order to 
increase the concentration of either conduction electrons or holes. A semicon- 
ductor with more conduction electrons than holes is called n-type; a semi- 
conductor with more holes than electrons is called p-type. The letters n and p 
signify negative and positive majority carriers. Consider a silicon crystal in 
which some of the Si atoms have been substituted by phosphorus atoms. 
Phosphorus is just to the right of Si in the periodic table, hence each P has 
exactly one electron more than the Si it replaces. These extra electrons do not 
fit into the filled valence band; hence a Si crystal with some P atoms will contain 
more conduction electrons and, by the law of mass action, fewer holes than a 
pure Si crystal, Next consider aluminum atoms. Aluminum is just to the left 
of Si in the periodic table, hence Al has exactly one electron fewer than the Si 
it replaces. As a result, Al atoms increase the number of holes and decrease the 
number of conduction electrons. 

Most impurities in the same columns of the periodic table as P and Al will 
behave in Si just as P and Al behave. What matters is the number of valence 
electrons relative to Si and not the total number of electrons on the atom. 
Impurities from other columns of the periodic table will not behave so simply. 
Similar reasoning can be applied to other semiconductors, for example GaAs. 
For the present we assume that each donor atom contributes one electron which 
may enter the conduction band or fill one hole in the valence band. We also 
assume that each acceptor atom removes one electron, either from the valence 
band or from the conduction band. These assumptions are called the approxi- 
mation of fully ionized impurities: all impurities when ionized are either posi- 
tively charged donors D* or negatively charged acceptors A~. 

The electrical neutrality condition (4) told us that 


An =n, — m = 1y* — 14. (25) 


363 


364 


Chapter 13: Semiconductor Statistics 


Because n, = n,?/n, from the mass action law, we see that (25) leads to a quadratic 
equation for n,: 


n.? — n,An = n/?. (26) 
The positive root is 
ne = ${[(An)? + 4n;7]'? + An} , (27a) 
and because n, = n, — An we have 
ny = 4{[(An)? + 4n?]'? — An}. (27b) 


Most often the doping concentration is large compared to the intrinsic con- 
centration, so that either n, or n, is much larger than n,: 


|An| > nj. (28) 


This condition defines an extrinsic semiconductor. The square roots in (27) 
can then be expanded: 


[(An)? + 4n2]"? = |An|[1 + (2n,/An)?]*? 
= |An| + 2n;7/|An}. (29) 

In an n-type semiconductor An is positive and (27) becomes 
n. = An +n?/An = An; — m, =~ n?/An « n,. (30) 


In a p-type semiconductor An is negative and (27) becomes 


n, = n2/|An| «nj; my = |An| + n?/|An| ~ [An]. (1) 


The majority carrier concentration in the extrinsic limit (28) is nearly equal to 
the magnitude of An, while the minority carrier concentration is inversely 
proportional to |An|. 


Fermi Leyel in Extrinsic Semiconductor 


By use of the mass action law we calculated the carrier concentrations without 
having to calculate the Fermi level first. The Fermi level is obtained from n, or 
n, by solving (17) or (21) for yu: 


Degenerate Semiconductors 


n-type, An = 10!? cm 


Intrinsic level 


p-type, An = —10"? cm=3 
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Figure 13.3. The Fermi level in silicon as a function of temperature, for 

various doping concentrations. The Fermi levels are expressed relative to 
the band edges. A small decrease of the energy gap with temperature has 
been neglected. 


B= &, — tlog(n,/n.) = &, + tlog(n,/n). (32) 


We may now use (27) to find as a function of temperature and doping level 
An. Figure 13.3 gives numerical results for Si. With decreasing temperature the 
Fermi level in an extrinsic semiconductor approaches either the conduction or 
the valence band edge. 


Degenerate Semiconductors 


When one of the carrier concentrations is increased and approaches the quan- 
tum concentration, we may no longer use the classical distribution (10) for that 
carrier. The calculation of the carrier concentration now follows the treatment 
of the Fermi gas in Chapter 7. The sum over all occupied orbitals, which is 
equal to the number of electrons, is written as an integral over the density of 
states times the distribution function: 


N= f deD(e)f(e) . (33) 
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where for free particles of mass m the density of states is 


V (2m\3?_, 
90 = 53 (5) ell, (34) 


That is, D(z)de is the number of orbitals in the energy interval (ce + de). To 
make the transition to conduction electrons in semiconductors we replace N 
by n,V; m by m,*; and ¢ by ¢ — ¢,. We obtain 


ane 1. (2m in de(e — ¢,)'/? “3 
© 2n?\ Hh? fe 1 + exp[(e — y)/t] eel 


Let x = (¢ — e)/tandy = (u — &,)/t. We use the definition (16) of n, to obtain 


y is ex! 2 
n,/n, = Kn) = — gk 


Jnv? 1 + exp(x — n) eb, 


The integral J(y) in (36) is known as the Fermi-Dirac integral. 
When «, — « > t we have —y » 1, so that exp(x — 7) > 1. In this limit 


eet 2 
ny/Ny ~ — et [, dx e~*x'!? = — 1(3/2)e"= exp[(u — e)/t], (37) 
Vv vu 


the familiar result for the ideal gas. 

In semiconductors the electron concentration rarely exceeds several times 
the quantum concentration n,. The deviation between the value of 1 from (35) 
and the approximation (37) then can be expanded into a rapidly converging 
power series of the ratio r = n,/n,, called the Joyce—Dixon approximation :* 


n — logr = wi - (j - ve Speeiere a =U (38) 


Figure 13.4 compares the exact relation (36) with the approximations (37) and 
(38). 


* W. B. Joyce and R. W. Dixon, Appl. Phys. Lett. 31, 354 (1977). If the right side of (38) is written 
as )A,r", the first four coefficients are 4, = 3.53553 x 107"; A, = —4.95009 x 10-3; 4, = 
1.48386 x 107*; Ay = —4.42563 x 107°. 
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Figure 13.4 Electron concentration as a function of Fermi level, for 1 near or 
above conduction band edge e,. The dashed curve represents the first term of the 


Joyce-Dixon approximation (38). 
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When n, is no longer small compared to n,, the expression of the mass action 
law must be modified. In Problem 4 we ask the reader to show that 


nen, = nj? exp[—n,//8n, + ° +"). | (39) 


If the gap itself depends on the carrier concentrations, the value of n; to be used 
here will depend on concentration. 


Impurity Levels 


The addition of impurities to a semiconductor moves some orbitals from the 
conduction or valence band into the energy gap, where the orbitals now appear 
as localized bound states. We consider phosphorous in a silicon crystal. If the 
P atom has released its extra electron to the Si conduction band, the atom 
appears as a positively charged ion. The positive ion attracts the electrons in 
the conduction band, and the ion can bind an electron just as a proton can 
bind an electron in a hydrogen atom. However, the binding energy in the 
semiconductor is several orders of magnitude lower, mostly because the binding 
energy is to be divided by the square of the static dielectric constant, and 
partly because of mass effects. Table 13.2 gives the ionization energies for 
column V donors in Si and Ge. The lowest orbital of an electron bound to a 
donor corresponds to an energy level Ag, = & — & below the edge of the 
conduction band (Figure 13.5). There is one set of bound orbitals for every 
donor. 

A parallel argument applies to holes and acceptors. Orbitals are split off 
from the valence band, as in Figure 13.5. For each acceptor atom there is one 
set of bound orbitals with an ionization energy Ac, = 6, — ¢,, of the same order 
as Ae,. lonization energies for column III acceptors in Si are listed in Table 13.2. 

In GaAs the ionization energies for all column VI donors except oxygen are 
close to 6meV. For zinc, the most important acceptor, Ae, = 24meV. Some 


Table 13.2 Ionization energies of column V donors and 
column III acceptors in Si and Ge, in meV 


Donors Acceptors 
RB As Sb B Al Ga In 


Si 45 49 39 45 37 65 16 
Ge 12.0 12.7 9.6 10.4 10.2 10.8 11.2 
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Figure 13.5 Donor and acceptor impurity levels in the energy 
gap of a semiconductor. 


impurities generate orbitals deep inside the forbidden gap, sometimes with 
multiple orbitals corresponding to different ionization states. 


Occupation of Donor Levels 


A donor level can be occupied by an electron with either spin up or spin down. 
Hence there are two different orbitals with the same energy. However, the 
occupations of these two orbitals are not independent of each other: Once the 
level is occupied by one electron, the donor cannot bind a second electron with 
opposite spin. As a result, the occupation probability for a donor level is not 
given by the simple Fermi-Dirac distribution function, but by a function 
treated in Chapter 5. We write the probability that the donor orbital is vacant, 
so that the donor is ionized, in a form slightly different from (5.73): 


1 


D+) = —.—__——_.. 
fp") 1 + 2exp[(u — ¢,)/t] 


(40) 
Here «, is the energy of a singly occupied donor orbital relative to the origin 


of the energy. The probability that the donor orbital is occupied by an electron, 
so that the donor is neutral, is given by (5.74): 


1 


NO) T+ expla = wie] 


(41) 
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Acceptors require extra thought. In the ionized condition A~ of the acceptor, 
each of the chemical bonds between the acceptor atom and the surrounding 
semiconductor atoms contains a pair of electrons with antiparallel spins. There 
is only one such state, hence the ionized condition contributes only one term, 
exp[(s — ¢,)/t], to the Gibbs sum for the acceptor. In the neutral condition A 
of the acceptor, one electron is missing from the surrounding bonds. Because 
the missing electron may have either spin up or spin down, the neutral condition 
is represented twice in the Gibbs sum for the acceptor, by a term 2 x 1 = 2. 

Hence the thermal average occupancy is 


-)_ _expi(u — s/t] _ 1 
oe exp[(u — e,)/t] 1 + 2exp[(e, — w/t] se 


The neutral condition A, with the acceptor orbital unoccupied, occurs with 
probability 
1 


= = 5 43 
2+ expliu — aye] — 1+ Sexpliu — aye] pes 


f(A) = 


The value of An = ny* — n,~ is the difference of concentrations of D* and A~. 
From (40) or (42) we have 


Na 


ny =n f(D?) = Te Deple aa . (44) 
ng = nef(A-) = — 2 _. (45) 
4 1 + 2exp[(e, — )/t] 
The neutrality condition (4) may be rewritten as 
n=n,t+ng =m+ny* =n*. (46) 


This expression may be visualized by a logarithmic plot of n~ and n* as func- 
tions of the position of the Fermi level (Figure 13.6). The four dashed lines 
represent the four terms in (46); the two solid lines represent the sum of all 
positive and all negative charges. The actual Fermi level occurs where the total 
positive charges equal the total negative charges. 

For ny* —n, > nj, as in Figure 13.6, the holes can be neglected; for 
n, —ny* > n, the electrons can be neglected. If one of the two impurity 
species can be neglected, the majority carrier concentration can be calculated 
in closed form. Consider an n-type semiconductor with no acceptors. The 
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Figure 13.6 Graphical determination of Fermi level and electron concentration in 
an n-type semiconductor containing both donors and acceptors. 


neutrality point in Figure 13.6 is now given by the intersection point of the n* 
curve with the n, curve. If the donor concentration is not too high, the inter- 
section will be on the straight portion of the n, curve, along which the approxi- 
mation (17) holds. We rewrite this as 


exp(q/t) = (ne/n,) exp(é,/t); (47) 


exp[(H — &4)/t] = (n./n,)exp[(e. — &a)/t] = ne/ne* 5 (48) 
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where 
n.* = n,exp[ —(&, — &4)/t] = n,exp(—Aey/t) (49) 


is the electron concentration that would be present in the conduction band if 
the Fermi level coincided with the donor level. Here Ae, = ¢, — &, is the donor 
ionization energy. 

We insert (48) into (44) and set n, = n,* to obtain 


n= eae (50) 
n.? + 4nn,.* = 4nyn,*. (51) 

This is a quadratic equation in n,; the positive solution is 
ne = An.*{[1 + (8ng/n.*)]!/? — 1}. (52) 


For shallow donor levels, n,* is large and close to n,. If the doping is sufficiently 
weak that 8n, « n,*, the square root may be expanded by use of 


(+x? ~1+4x— bP? +--, (53) 
for x « 1. With x = 8n,/n,* we obtain 
Ne = My — 2n4?/n.* = ng(1 — 2ng/n.*). (54) 


The second term in the parentheses gives the first order departure from complete 
ionization. For example, for P in Si at 300K, we have Ae, ~ 1.747 from 
Table 13.2, so that n,* ~ 0.175n, from (49). If ny = 0.01n,, Eq. (54) predicts 
that 11.4 pet of the donors remain un-ionized. The limit of weak ionization is 
the subject of Problem 6. 


RS SE SES OE PS SI PLR OR 


Example: Semi-insulating gallium arsenide. Could pure GaAs be prepared, it would have 
an intrinsic carrier concentration at room temperature of n; < 10’ cm~*. With such a low 
concentration of carriers, 10~'* less than a metal, the conductivity would be closer to an 
insulator than to a conventional semiconductor. Intrinsic GaAs would be useful as an 
insulating substrate on which to prepare thin layers of doped GaAs as needed for devices. 
There does not exist a technology to purify any substance to 10’ impurities per cm*. 


p-n Junctions 


However, it is possible to achieve near intrinsic carrier concentrations in GaAs by doping 
with high concentrations (10'S—10'7 cm~*) of oxygen and chromium together, two impu- 
rities that have their impurity levels near the middle of the energy gap. Oxygen enters an 
As site and is a donor in GaAs, as expected from the position of O in the periodic table 
relative to As; the energy level* is about 0.7 eV below ¢.. Chromium is an acceptor with an 
energy level about 0.84 eV below ¢,. 

Consider a GaAs crystal doped with both oxygen and chromium. The ratio of the two 
concentrations is not critical; anything with an O:Cr ratio between about 1:10 and 10:1 
will do. Ifthe concentrations of all other impurities are small compared with those of O and 
Cr, the position of the Fermi level will be governed by the equilibrium between electrons 
on O and holes on Cr. The construction of Figure 13.6 applied to this system shows that 
over the indicated concentration ratio range the Fermi level is pinned to a range between 
1.5t above the O level and 1.5t below the Cr level. With the Fermi level pinned near the 
middle of the energy gap, the crystal must act as nearly intrinsic. 

Gallium arsenide doped in this way is called semi-insulating GaAs and is used extensively 
as a high-resistivity (10° to 10'° Q cm) substrate for GaAs devices. A similar doping pro- 
cedure is possible in InP, with iron taking the place of chromium. 


{a aE entre <r eae 
p-n JUNCTIONS 


Semiconductors used in devices are almost never uniformly doped. An under- 
standing of devices requires an understanding of nonuniformly doped semi- 
conductors, particularly of structures called p—n junctions in which the doping 
changes with position from p-type to n-type within the same crystal. We consider 
a semiconductor crystal inside which the doping changes abruptly at x = 0 
from a uniform donor concentration n, to a uniform acceptor concentration 
Ng, as in Figure 13.7a. This is an example of a p-n junction. More complicated 
device structures are made up from simple junctions: a bipolar transistor has 
two closely spaced p—n junctions, of the sequence p-n-p or n-p-n. 

p-n junctions contain a built-in electrostatic potential step V,,, even in the 
absence of an externally applied voltage (Figure 13.7b). With no externally 
applied voltage, the electrons on the two sides of the junction are in diffusive 
equilibrium, which means that the chemical potentials (Fermi levels) of the two 
sides are the same. Because the position of the Fermi level within the band 
structure depends on the local doping, constancy of the Fermi level forces a 
shift in the electron energy bands in crossing the junction (Figure 13.7c). The 
shift is eV,;. The potential step of height eV,; is an example of the potential step 
required to equalize the total chemical potential of two systems when the 
intrinsic chemical potentials are unequal, as discussed in Chapter 5. 


* R. Zucca, J. Appl. Phys. 48, 1987 (1977). The energy assignment is somewhat uncertain. 
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(a) 


(b) 


(d) 


Figure 13.7 A p-n junction. (a) Doping distribution, It is assumed 
that the doping changes abruptly from n-type to p-type. The two 
doping levels are usually different. (b) Electrostatic potential. The 
built-in voltage V,; establishes diffusive equilibrium between the 
two sides with different electron concentrations as well as hole 
concentrations. (c) Energy bands. Because the Fermi level must be 
constant throughout the structure, the bands on the two sides are 
shifted relative to each other. (d) Space charge dipole required to 
generate the built-in voltage and to shift the energy bands. 


p-n Junctions 


We assume that the two doping concentrations n,, n, lie in the extrinsic but 
nondegenerate range, as defined by 


n; K Ng K Nes Nn; K Ng K Ny. (55) 


If the donors are fully ionized on the n side and the acceptors fully ionized on 
the p side, then the electron and hole concentrations satisfy 


Ne > Nas Mm ~= Nas (56) 


one on the n side and the other on the p side, (We have dropped the superscripts 
+ from nj, n,.) The conduction band energies on the n and p sides follow from 
(17): 


Ben = H — Tlog(ng/n.)3 (57) 
bp = bh — tlog(n.p/N,) = — tlog(n;*/nn,), (58) 
by (22c). Hence 
Vai = cp — ben = Tlog(nang/n;7) , (59) 
or 
Vy: = &g — tlog(nty/Nana). (60) 


For doping concentrations n, ~ 0.01n, and n, ~ 0.01n,, we find eVy; ~ &.— 9,22, 
which is 0.91 eV in silicon at room temperature. 

A step in electrostatic potential is required to shift the band edge energies on 
the two sides of the junction relative to each other. The electrostatic potential 
g(x) must satisfy the Poisson equation 


(SI) oF es (61) 


where p is the space charge density and € the permittivity of the semiconductor. 
Space charge must be present whenever ¢ varies. In the vicinity of the junction 
the charge carriers no longer neutralize the impurities as in the bulk material. 
The space charge must be positive on the n side and negative on the p side 
(Figure 13.7d). Positive space charge on the n side means that the electron 
concentration is less than the donor concentration. Indeed, as the conduction 
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band edge is raised relative to the fixed Fermi level, (17) predicts an exponential 
decrease of the electron concentration n,. 

Take the origin of the electrostatic potential at x = —o,so that p(— 90) = 0. 
Then ¢,(x) = @,(— 2) — e@(x), and (17) becomes 


n(x) = ngexp[ep(x)/t]. (62) 
The Poisson equation (61) is 


op pee sre en 
all ae [1g — n(x)] = =e! — exp(eg/t)]. (63) 


Multiply by 2de/dx to obtain 


4 dp oe id dg\* _ _2eng d ee me 64 

“dx dx? dx \dx e dx)® ¢ explepit)(; o 
Integrate with the initial condition ~(— 2%) = 0 
dg\? __2eny mkt 

(2) = Sipe Mets 5 exP(eg/t) : (65) 


At the interface x = 0 we assume that 


—9(0) = Vv, > t/e, (66) 


where J, is that part of the built-in electrostatic potential drop that occurs on 
the n side, The exponential on the right-hand side of (65) can be neglected, and 
we obtain 


E = [(2en,/e)(V, — t/e)]"! (67) 
for the x component of the electric field E = —d@/dx at the interface. Similarly, 
E = [(2en,/e)(V, — t/e)]'?. (68) 


where JV, is that part of the built-in electrostatic potential drop that occurs on 
the p side. The two E fields must be the same; from this and from ¥, + V, = Vai 
we find 


1/2 
joe (= SE Va 21) Zs (69) 


€ Ny + Ny 


Reverse-Biased Abrupt p-n Junction 


The field E is the same as if on the n-type side all electrons had been depleted 
from the junction to a distance 


1/2 1/2 
v, <i (2, 7 “) - Aaa 2) , (70) 


eng e ng(ng + Ny) 


with no depletion at |x| > w,. The distance w, is used in semiconductor device 
theory as a measure of the depth of penetration of the space charge transition 
layer into the n side. 

Similarly, on the p side, 


E Di 1/2 Oy 1/2 
w= <= (<u, - ve) = G <__™ _(y, — 2) . (71) 


a en, e@ n(n + Na) 


The total depletion width w, + w, is 
V2 (Yi — 2t/e) 
= 2 
) E (72) 


Ifwe assume n, = ny = 10'Scm~?;€ = 10€9; and Y; — 2t/e = 1 volt, we find 
E = 4.25 x 10*Vcm™! and w = 4.70 x 1075) 


Reyerse-Biased Abrupt p—n Junction 


Let a voltage V be applied to a p-n junction, of such sign that the p side is at a 
negative voltage relative to the n side, which means that V raises the potential 
energy of the electrons on the p side. This voltage will drive conduction electrons 
from the p side to the n side, and holes from the n side to the p side. But the 
p side in bulk contains a very low concentration of conduction electrons, and the 
n side contains a very low concentration of holes, consistent with the mass 
action law. As a result, very little current flows. The distributions of electrons, 
holes, and potential are approximately the same as if the built-in voltage were 
increased by the applied voltage, Figure 13.8. The field at the interface is now 
given by 


| ] 


2, v2 | 
jaa (3 [Vv + Ki) - 201) 3 (73) 
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Figure 13.8 Reverse-biased p-n junction, showing the quasi-Fermi 
levels 1, and pp. 


and the junction thickness is given by 


42 2 
w= (25st (|V| + Va — 21) = EV + Vor = 2/e] (74) 
e@ Nang E 


In the semiconductor device literature we often find (73) and (74) without the 
term 2t/e, because certain approximations have been made about the space 
charge and field distribution; we have solved the Poisson equation with the 
correct electron distribution (62). 


Current Flow: Drift and Diffusion 


NONEQUILIBRIUM SEMICONDUCTORS 


Quasi-Fermi Levels 


When a semiconductor is illuminated with light of quantum energy greater 
than the energy gap, electrons are raised from the valence band to the conduc- 
tion band. The electron and the hole concentrations created by illumination 
are larger than their equilibrium concentrations. Similar nonequilibrium con- 
centrations arise when a forward-biased p-n junction injects electrons into a 
p-type semiconductor or holes into an n-type semiconductor. The electric 
charge associated with the injected carrier type attracts oppositely charged 
carriers from the external electrodes of the semiconductor so that both carrier 
concentrations increase. 

The excess carriers eventually recombine with each other. The recombination 
times vary greatly with the semiconductor, from less than 10~°s to longer than 
10~*s. Recombination times in high purity Siare near 10~s. Even the shortest 
recombination times are much longer than the times (~10~'?s) required at 
room temperature for the conduction electrons to reach thermal equilibrium 
with each other in the conduction band, and for the holes to reach thermal 
equilibrium with each other in the valence band. Thus the orbital occupancy 
distributions of electrons and of holes are very close to equilibrium Fermi-Dirac 
distributions in each band separately, but the total number of holes is not in 
equilibrium with the total number of electrons. 

We can express this steady state or quasi-equilibrium condition by saying 
that there are different Fermi levels 1, and 1, for the two bands, called quasi- 
Fermi levels: 


1 
1 + exp[(e — y,)/t]’ 
= sell eer 
1 + exp[(e — y,)/t]’ 


Set) = 
Slat) = 


Quasi-Fermi levels are used extensively in the analysis of semiconductor devices. 


Current Flow: Drift and Diffusion 


If the conduction band quasi-Fermi level is at a constant energy throughout a 
semiconductor crystal, the conduction electrons throughout the crystal are in 
thermal and diffusive equilibrium, and no electron current will flow. Any 
conduction electron flow in a semiconductor at a uniform temperature must be 
caused by a position-dependence of the conduction band quasi-Fermi level. 
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If the gradient of this level is sufficiently weak, we may assume that the con- 
tribution of conduction electrons to the total electrical current density is 
proportional to this gradient: 


J, & grad p,. (76) 


Here J, is an electrical current density, not a particle flux density, Because each 
electron carries the charge —e, we have 


J. = (—e) x (electron flux density) , (77) 
where the electron flux density is defined as the number of conduction electrons 
crossing unit area in unit time. The close connection of (76) to Ohm’s law is 
treated in Chapter 14. Because the flow of particles is from high to low chemical 
potential, the conduction electron flux is opposite to grad p,, but because 
electrons carry a negative charge, the associated electrical current density is 
in the direction of grad j,. We view grad 1, as the driving force for this current. 


Fora given driving force, the current density is proportional to the concentration 
n, of conduction electrons. Thus we write 


J, = fin, grad fu, , (78) 

where the proportionality constant ji, is the electron mobility. The symbol 7, 
should not be confused with the conduction band quasi-Fermi level, 1. 

If the electron concentration is in the extrinsic but nondegenerate range. 


n«K<ne «KN, , (79) 


the conduction band quasi-Fermi level is given by (15), which can be written 
in terms of the electron concentration as 


Me = & + tlog(n,/n,). (80) 
Thus (78) becomes 
J. = fin. grade, + fi,t grad n,. (81) 


A gradient in the conduction band edge arises from a gradient in the electrostatic 
potential and thus from an electric field: 


grade, = —egradg = eK. (82) 


Current Flow: Drift and Diffusion 


We introduce an electron diffusion coefficient D, by the Einstein relation 


Laas = Ate , (83) 


discussed in Chapter 14. We now write (78) or (81) in the final form 


J. = efin.E + eD,gradn,. (84) 


There are two different contributions to the current: one caused by an electric 
field and one caused by a concentration gradient. 

Analogous results apply to holes, with one difference. The valence band quasi- 
Fermi level is not the chemical potential for holes, but is the chemical potential 
for the electrons in the valence band. Holes are missing electrons; a hole current 
to the right is really an electron current to the left. But holes carry a positive 
rather than a negative charge. The two sign reversals cancel, and we may view 
grad , as the driving force for the contribution J, of holes to the total electrical 
current density. We write, analogously to (78), 


J, = Miyn, grad u,. (85) 


Carrying through the rest of the argument leads to 


| J, = +ef,n,E — eD, grad n, (86) 


as the analog of (84), with the Einstein relation D,, = [i,t/e. Note the different 
sign in the diffusion term: Holes, like electrons, diffuse from high to low con- 
centrations, but hole diffusion makes the opposite contribution to the electric 
current, because holes carry the opposite charge. 


Example: Injection laser, The highest nonequilibrium carrier concentrations in semi- 
conductors occur in injection lasers. When by electron injection the occupation f,(¢,) of 
the lowest conduction band orbital becomes higher than the occupation /,(¢,) of the highest 
valence band orbital, the population is said to be inverted. Laser theory tells us that light 
with a quantum energy ¢, — ¢, = ¢, can then be amplified by stimulated emission. The 
condition for population inversion is that 


Fale) > folev). (87) 


With the quasi-Fermi distributions (75) this condition is expressed as 


He — Hy > & — by = by. (88) 
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Electron flow 


Figure 13.9 Double-heterostructure injection laser. Electrons flow from the 
right into the active layer, where they form a degenerate electron gas, The 
potential barrier provided by the wide energy gap on the p side prevents the 
electrons from escaping to the left. Holes flow from the left into the active 
layer, but cannot escape to the right. When (88) is attained, laser action 
becomes possible. 


For laser action the quasi-Fermi levels must be separated by more than the energy gap. 
The condition (88) requires that at least one of the quasi-Fermi levels lie inside the band 
to which it refers. This is a necessary, but not a sufficient condition for laser operation. 
An important additional condition is that the energy gap is a direct gap rather than an 
indirect gap. The distinction is treated in solid state physics texts, The most important 
semiconductors with a direct gap are GaAs and InP. 

The population inversion is most easily achieved in the double heterostructure of Fig- 
ure 13.9; here the lasing semiconductor is embedded between two wider-gap semiconductor 
regions of opposite doping. An example is GaAs embedded in AlAs. In such a structure 
there is a potential barrier that prevents the outflow of electrons to the p-type region, and 
an opposite potential barrier that prevents the outflow of holes to the-n-type region. 
Except for the current caused by the recombination itself, the electrons in the active layer 
are in diffusive equilibrium with the electrons in the n contact, and the electron quasi- 
Fermi level in the active layer lines up with the Fermi level in the n contact. Similarly, the 
valence band quasi-Fermi level lines up with the Fermi level in the p contact. Inversion 
can be achieved if we apply a bias voltage larger than the voltage equivalent of the active 
layer energy gap. Most injection lasers utilize this double heterostructure principle. 


EEE SE re 


Current Flow: Drift and Diffusion 


Example: Carrier recombination through an impurity level. Electrons and holes can re- 
combine either by an electron falling directly into a hole with the emission of a photon, 
or they can recombine through an impurity level in the energy gap. The impurity process 
is dominant in silicon. We discuss the process as an instructive example of quasi-equilibrium 
semiconductor statistics. Consider an impurity recombination orbital at energy ¢, in 
Figure 13.10. Four transition processes are indicated in the figure. We assume that the 
rate R,, at which conduction electrons fall into the recombination orbitals is described by 
a law of the form 


Ry = (1 — fr)ne/te » (89) 
where /f, is the fraction of recombination orbitals already occupied by an electron (and 


hence not available), and ¢, is a characteristic time constant for the capture process. We 
assume the reverse process proceeds at the rate 


Rye = fre te’, (90) 


where ¢,’ is the time constant for the reverse process. We take R,, independent of the con- 
centration of conduction electrons, because we assume that n, « n,. The time constants t, 


Figure 13.10 Electron-hole recombination through 
impurity recombination orbitals at ¢, inside the energy 


gap. 
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and 1,’ are related, because in equilibrium the two rates R,, and R,, must cancel. Thus 


Ns a afdescmleitts 
mie es = O) 


with f, and n, evaluated in thermal equilibrium, which means we use (17) for n,. We ignore 
the spin multiplicity of the recombination levels. With the equilibrium Fermi-Dirac 
distribution for f,, we have 


(1 — f/f, = exp[—(u — ¢,)/t]. (92) 
Thus (91) becomes 
* 
me = exp[—(6. — et] =", (93) 
fe Oat, te 


where n,* is defined as the conduction electron concentration that would be present if the 
equilibrium Fermi level in (17) coincided with the recombination level. If (92) and (93) 
are inserted into (89) and (90), the net electron recombination rate becomes 


Ry = Ry — Re = [0 — fe — fine 004) 


The analogous recombination rate for holes is obtained by the substitutions 


Rg, Mes Me* te > Rhys Mas Ma tes 
and 


holt; Daisies (95) 


Here t, is the lifetime of holes in the limit that all recombination centers are occupied by 
electrons, and n,* is, by definition, 


n,* = n,exp[—(e, — ¢,)/t] = n?7/n.*. (96) 
With these substitutions the net hole recombination rate is 
Ry = [fm — (1 — fn )/th. (97) 


In steady state the two recombination rates must be equal: R, = R, = R. Equations (94) 
and (97) are two equations for the two unknowns f, and R. We eliminate f, to find 


NM, — nj? 
(n.* + ne)th + (M* + My)te 


(98) 


Summary 


This is the basic result of the Hall-Shockley-Read recombination theory.* Applications 
are developed in Problems 10 and 11. 


SUMMARY 


1. In semiconductors the electron orbitals are grouped into a valence band 
(completely occupied at t = 0 in a pure semiconductor) and a conduction 
band (completely empty at t = 0 in a pure semiconductor), separated by 
an energy gap. Electrons in the conduction band are called conduction 
electrons; empty orbitals in the valence band are called holes. 


LS 


The probability of occupancy of a band orbital with energy ¢ is governed 
by the Fermi-Dirac distribution function 


f@) = 1 + exp[(u — e)/t] 


Here yu is the chemical potential of the electrons, called the Fermi level. 


3. The energetic location of the Fermi level in an electrically neutral semi- 


conductor is governed by the neutrality condition 


n. — nm, = An. 


Here n, and n, are the concentrations of conduction electrons and holes, 
and An is the excess concentration of positively charged impurities over 
negatively charged impurities. 


> 


A semiconductor is said to be in the classical regime when n, « n, and 
ny, « n,. Here 


Ney = 2(me,4*t/2nh?)3/? 
are the quantum concentrations for electrons and holes; m,* and m,* are 
effective masses for electrons and holes. In the semiconductor literature, 


n, and n, are called the effective densitites of states for the conduction and 
valence bands. 


* R.N. Hall, Phys. Rev. 87, 387 (1952); W. Shockley and W. T. Read, Jr., Phys. Rev, 87, 835 (1952). 
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5. 


= 


° 


In the classical regime 


n, = n,exp[—(¢, — 4)/t] , 
ny exp[—(u — &)/t] , 


uh 


where ¢, and ¢, are the energies of the edges of the conduction and valence 
bands. 


The mass action law states that in the classical regime the product 
NN, = N,? = nny exp(—é,/T) 


is independent of the impurity concentration. The intrinsic concentration 
n, is the common value of n, and n, in an intrinsic (= pure) semiconductor. 
The quantity 


is the energy gap. 


A semiconductor is called n-type when negative charge carriers (=con- 
duction electrons) dominate; it is called p-type when positive charge 
carriers (=holes) dominate. The sign of the dominant charge carriers is 
opposite to the sign of the dominant ionized impurities. 


A p-n junction is a rectifying semiconductor structure with an internal 
transition from p-type to n-type. A p-n junction contains internal electric 
fields even in the absence of an applied voltage. For an abrupt junction the 
field at the p-n interface is 


_ (2¢e rata ity 1/2 
B= (2 [aM + he 2) 


Here € is the permittivity, n, and n, are ionized acceptor and donor con- 
centrations, and |V| and V,,; are the applied and the built-in reverse bias. 


The electric current densities due to electron and hole flow are given by 


J, = ejfin.E + eD,gradn, , 


J, = efi,n,E — eD, grad n,,. 


Here ji, and ji, are the electron and hole mobilities, and 


Problems 


D. = ftet/e, — Dy = Tiyt/e 


are the electron and hole diffusion coefficients. 


PROBLEMS 


1. Weakly doped semiconductor. Calculate the electron and hole concentra- 
tions when the net donor concentration is small compared to the intrinsic 
concentration, |An| « nj. 


2. Intrinsic conductivity and minimum conductivity. The electrical conduc- 
tivity is 


G = e(Nefle + Malt) » (99) 


where ji, and jj, are the electron and hole mobilities. For most semiconductors 
il, > ji,. (a) Find the net ionized impurity concentration An = ng* — nq” for 
which the conductivity is a minimum. Give a mathematical expression for this 
minimum conductivity. (b) By what factor is it lower than the conductivity of an 
intrinsic semiconductor? (c) Give numerical values at 300 K for Si for which the 
mobilities are 7, = 1350 and fj, = 480cm?V~'s~', and for InSb, for which 
the mobilities are 7, = 77000 and ji, = 750cm?V~'s~'. Calculate missing 
data from Table 13.1. 


3. Resistivity and impurity concentration. A manufacturer specifies the re- 
sistivity p = 1/o of a Ge crystal as 20 ohm cm. Take fi, = 3900 cm? V-' s~! 
and jj, = 1900 cm? V~! s~!. What is the net impurity concentration a) if the 
crystal is n-type; b) if the crystal is p-type? 


4. Mass action law for high electron concentrations. Derive (39), which is the 
form of the law of mass action when n, is no longer small compared to n,. 


5. Electron and hole concentrations in InSb. Calculate n,, n,, and  — «, for 
n-type InSb at 300K, assuming ny* = 4.6 x 10'°cm~* = n,. Because of the 
high ratio n,/n, and the narrow energy gap, the hole concentration is not 
negligible under these conditions, nor is the nondegenerate approximation 
n, <n, applicable. Use the generalized mass action law (39). Solve the tran- 
scendental equation for n, by iteration or graphically. 


6. Incomplete ionization of deep impurities. (a) Find the fraction of ionized 
donor impurities if the donor ionization energy is large enough that Ae, is 
larger than tlog(8n,/n.) by several times t. The result explains why substances 
with large impurity ionization energies remain insulators, even if impure. 
(b) If we draw the line between an insulator and a semiconductor at n, = 10° 
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cm, find the donor ionization energy above which the most impure substance 
will be an insulator. 


7. Built-in field for exponential doping profile. Suppose that in a p-type 
semiconductor the ionized acceptor concentration at x = x, is na =n «Kn, 
and falls off exponentially to a value n,~ = nj > n,; at x = Xz. What is the 
built-in electric field in the interval (x,,x)? Give numerical values for ny/n, = 
10° and x. — x; = 10~5cm. Assume T = 300K. Impurity distributions such 
as this occur in the base region of many n-p-n transistors. The built-in field 
aids in driving the injected electrons across the base. 


8. Einstein relation for high electron concentrations. Use the Joyce-Dixon 
approximation (38) to give a series expansion of the ratio D,/ji, for electron 
concentrations approaching or exceeding n,. 


9. Injection laser. Use the Joyce-Dixon approximation to calculate at T = 
300K the electron-hole pair concentration in GaAs that satisfies the inversion 
condition (88), assuming no ionized impurities. 


10. Minority carrier lifetime. Assume both electron and hole concentrations 
in a semiconductor are raised by dn above their equilibrium values. Define a net 
minority carrier lifetime t by R = 6n/t. Give expressions for t in terms of the 
carrier concentrations n, and n,; the energy of the recombination level, as 
expressed by n,* and n,*; and the time constants r, and f,, in the limits of very 
small and very large values of én. Under what doping conditions is ¢ indepen- 
dent of dn? 


11. Electron-hole pair generation, Inside a reverse biased p-n junction both 
electrons and holes have been swept out. (a) Calculate the electron-hole pair 
generation rate under these conditions, assuming n,* = n,* and t= t, =. 
(b) Find the factor by which this generation rate is higher than the generation 
rate in an n-type semiconductor from which the holes have been swept out, 
but in which the electron concentration remains equal to ny” > n,.(c) Give a 
numerical value for this ratio for Si with nz* = 10'® em~3, 
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Tam conscious of being only an individual struggling weakly against the stream 
of time. But it still remains in my power to contribute in such a way that, when 
the theory of gases is again revived, not too much will have to be rediscovered. 


L. Boltzmann 


Kinetic Theory of the Ideal Gas Law 


In this chapter we give a kinetic derivation of the ideal gas law, the distribution 
of velocities of gas molecules, and transport processes in gases: diffusion, 
thermal conductivity, and viscosity. The Boltzmann transport equation is 
discussed. We also treat gases at very low pressures, with reference to vacuum 
pumps. The chapter is essentially classical physics because the quantum theory 
of transport is difficult. 


KINETIC THEORY OF THE IDEAL GAS LAW 


We apply the kinetic method to obtain an elementary derivation of the ideal gas 
law, pV = Nt. Consider molecules that strike a unit area of the wall of a 
container. Let v, denote the velocity component normal to the plane of the wall, 
as in Figure 14.1. If a molecule of mass M is reflected specularly (mirror-like) 
from the wall, the change of momentum of the molecule is 


—2M|p,. (1) 


This gives an impulse 2M|v.| to the wall, by Newton’s second law of motion. The 
pressure on the wall is 


__ (momentum change) /number of molecules striking (2) 
me per molecule unit area per unit time i 


Let a(v,)dv, be the number of molecules per unit volume with the z component 
of the velocity between v, and v, + dv,. Here fa(v,)dv, = N/V =n. The 
number in this velocity range that strike a unit area of the wall in unit time is 
a(v,)v, dv,. The momentum change of these molecules is —2Mv.a(v,)v, dv., 80 
that the total pressure is 


p= Jo 2Mv.talv.)de, =M J 22alo,\ao,. (3) 


The integral on the right is the thermal average of v,* times the concentration, 
so that p = Mn<v,?). The average value of $Mv,* is }t, by equipartition of 
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Pressure 
Figure 14.1 The change of momentum of a v v 
molecule of velocity v which is reflected from 
the wall of the container is —2M|p,|. Z axis 


energy (Chapter 3). Thus the pressure is 
p = nM¢0,*) = nt =(N/V)t; pV = Nr. (4) 


This is the ideal gas law. 

The assumption of specular reflection is convenient, but it is immaterial to the 
result. What comes into the surface must go back, with the same distribution, 
if thermal equilibrium is to be maintained. 


Maxwell Distribution of Velocities 


We now transform the energy distribution function of an ideal gas into a 
classical velocity distribution function. Often when we mean “speed” we shall 
say “velocity”, as this is the tradition in physics when no confusion is caused. 
In Chapter 6 we found the distribution function of an ideal gas to be 


S(én) = Aexp(—é,/t) (5) 


where f(é,) is the probability of occupancy of an orbital of energy 


2 2 
&, = wale) (6) 


in a cube of volume V = L*. The average number of atoms with quantum 
number between n and n + dn is (the number of orbitals in this range) x (the 
probability such an orbital is occupied). The number of orbitals in the positive 


Maxwell Distribution of Velocities 


octant of a spherical shell of thickness dn is 4(4nn*)dn, whence the desired 
product is 


(4nn? dn) f(e,) = 4nAn* exp(—e,/t)dn. (7) 


We take the spin of the atom as zero. 

To obtain the probability distribution of the classical velocity, we must 
find a connection between the quantum number n and the classical velocity 
ofa particle in the orbital ¢,. The classical kinetic energy $Mv’ is related to the 
quantum energy (6) by 


We consider a system of N particles in volume V, Let NP(v)dv be the number of 
atoms with velocity magnitude, or speed, in the range dv at v. This is evaluated 
from (7) and (8) by setting dn = (dn/dv)dv = (ML/hn)dv. We have 


NP(v)dv = 4dn? exp(—é,/t) a dv 
3 
= tni( Fr) v? exp(— Mv?/2r)dv. (9) 


From Chapter 6 we know that 2 = n/ng = (N/L*)(2nh?/Mz)*?, so that the 
factor standing to the left of v? becomes 


nN(2n)3/7h3M3L3 M \3/2 
IDM? Pen 4nN Taller (10) 
Thus 
P(v) = 4n(M/2nt)?/?v? exp(— Mv?/2z). | (11) 


This is the Maxwell velocity distribution (Figure 14.2). The quantity P(v)dv is the 
probability that a particle has its speed in dv at v. Numerical values of the root 
mean square thermal velocity and the mean speed are given in Table 14.1, 
using the results Uj. = (3t/M)!/? and @ = (8t/xM)'? from Problem 1. 
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Table 14.1 Molecular velocities at 273 K, in 10*ems~! 


Gas Ves c Gas Vras 

H, 18.4 16.9 O, 4.6 42 
He 13.1 12.1 Ar 43 4.0 
H,0 6.2 opie Kr 2.86 2.63 
Ne 5.8 5.3 Xe 2.27 2.09 
N2 49 4.5 Free electron 1100. 1013. 


Experimental verification. The velocity distribution of atoms of potassium 
which exit from the slit of an oven has been studied by Marcus and McFee.* 
The curve in Figure 14.3 compares the experimental results with the prediction 
of (12) below; the agreement is excellent. We need an expression for the velocity 
distribution of atoms that exit from a small hole? in an oven. This distribution is 
different from the velocity distribution within the oven, because the flux through 
the hole involves an extra factor, the velocity component normal to the wall. 
The exit beam is weighted in favor of atoms of high velocity at the expense of 
those at low velocity. In proportion to their concentration in the oven, fast atoms 


* P.M. Marcus and J. H. McFee, Recent research in molecular beams, ed. 1. Esterman, Academic 
Press, 1959. 

' In such experiments a round hole is said to be small if the diameter is less than a mean free path 
of an atom in the oven. If the hole is not small in this sense, the flow of gas from it will be governed 
by the laws of hydrodynamic flow and not by gas kinetics. 


Collision Cross Sections and Mean Free Paths 
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Figure 14.3. Measured transmission points and calculated 
Maxwell transmission curve for potassium atoms that exit from 
an oven at a temperature 157°C. The horizontal axis is the transit 
time of the atoms transmitted. The intensity is in arbitrary units; 
the curve and the points are normalized to the same maximum 
value. After Marcus and McFee. 


strike the walls more often than slow atoms strike the walls. The weight factor 
is the velocity component vcos @ normal to the plane of the hole. The average 
of cos @ over the forward hemisphere is just a numerical factor, namely 4. The 
probability that an atom which leaves the hole will have a velocity between 
v and v + dv defines the quantity P,..m(v)dv, where 


Pream() © VPMaxwent % 0° exp(—Mv?/2r) , (12) 


with Pyarwen given by (11). The distribution (12) of the transmission through a 
hole is called the Maxwell transmission distribution. 


Collision Cross Sections and Mean Free Paths 


We can estimate the collision rates of gas atoms viewed as rigid spheres. Two 
atoms of diameter d will collide if their centers pass within the distance d of 
each other. From Figure 14.4 we see that one collision will occur when an atom 
has traversed an average distance 


|= 1/nnd? , | (13) 
om 
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(a) 
Figure 14.4 (a) Two rigid spheres will collide 
if their centers pass within a distance d of each 
each other. (b) An atom of diameter d which 
travels a long distance L will sweep out a 
volume xd?L, in the sense that it will collide 
with any atom whose center lies within the 
volume. If n is the concentration of atoms, the 
average number of atoms in this volume is 
nnd? L. This is the number of collisions. The 
average distance between collisions is 


de: 
~ nnd?L nnd?’ 


(b) 


where n is the number of atoms per unit volume, The length / is called the mean 
free path: it is the average distance traveled by an atom between collisions. Our 
result neglects the velocity of the target atoms. 

We estimate the order of magnitude of the mean free path. If the atomic 
diameter d is 2.2 A as for helium, then the collision cross section g, is 


o, = nd” = (31422 % 40 Sem) P= 152) 10" cm +, (14) 


The concentration of molecules of an ideal gas at 0°C and 1 atm is given by the 
Loschmidt number 


No = 2.69 x 10!°atomscm~* , (15) 
defined as the Avogadro number divided by the molar volume at 0°C and | atm. 


The Avogadro number is the number of molecules in one mole; the molar 
volume is the volume occupied by one mole. We combine (14) and (15) to obtain 


Transport Processes 


the mean free path under standard conditions: 


1 1 
= = 244 x 1075 cm. (16 
|= Ting ~ (52 x 107 em)2.69 x 10 m=) 7 anne) 


This length is about 1000 times larger than the diameter of an atom. The 
associated collision rate is 


5 -1 
“ime x ve = 10! s-!, (16b) 


At a pressure of 10°° atm or 1 dynecm~?, the concentration of atoms is 
reduced by 107° and the mean free path is increased to 25cm. At 107° atm the 
mean free path may not be small in comparison with the dimensions of any 
particular experimental apparatus. Then we are in what is called the high vacuum 
region, also called the Knudsen region. We assume below that the mean free 
path is small in comparison with the relevant dimension of the apparatus, 
except in the section on laws of rarefied gases. 


TRANSPORT PROCESSES 


Consider a system not in thermal equilibrium, but in a nonequilibrium steady 
state with a constant flow from one end of the system to the other. For example, 
we may create a steady state nonequilibrium condition in a system by placing 
opposite ends in thermal contact with large reservoirs at two different tempera- 
tures. If reservoir 1 is at the higher temperature, energy will flow through the 
system from reservoir | to reservoir 2. Energy flow in this direction will increase 
the total entropy of reservoir 1 + reservoir 2 + system. The temperature 
gradient in the system is the driving force; the physical quantity that is trans- 
ported through the specimen in this process is energy. 

There is a linear region in most transport processes in which the flux is 
directly proportional to the driving force: 


flux = (coefficient) x (driving force) , (17) 
provided the force is not too large. Such a relation is called a linear phenome- 
nological law, such as Ohm’s law for the conduction of electricity. The definition 


of the flux density of a quantity A is: 


J, = flux density of A = net quantity of A transported across 
unit area in unit time. (18) 
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Particle Diffusion 


The net transport is the transport in one direction minus the transport in the 
opposite direction, Various transport laws are summarized in Table 14.2. 


Particle Diffusion 


In Figure 14.5 we consider a system with one end in diffusive contact with a 
reservoir at chemical potential 4, ; the other end is in diffusive contact with a 
reservoir at chemical potential 1... The temperature is constant. If reservoir 1 
is at the higher chemical potential, then particles will flow through the system 
from reservoir 1 to reservoir 2, Particle flow in this direction will increase the 
total entropy of reservoir 1 + reservoir 2 + system. 

Consider particle diffusion, first when the difference of chemical potential is 
caused by a difference in particle concentration. The flux density J,, is the 
number of particles passing through a unit area in unit time. The driving force 
of isothermal diffusion is usually taken as the gradient of the particle concentra- 
tion along the system: 


J, = —Dgradn. (19) 


The relation is called Fick’s law; here D is the particle diffusion constant or 
diffusivity. 

Particles travel freely over distances of the order of the mean free path | 
before they collide. We assume that in a collision at position z the particles 
come into a local equilibrium condition at the local chemical potential ju(z) and 
local concentration n(z). Let |. be the z component of the mean free path. Across 
the plane at z there is a particle flux density in the positive z direction equal to 
$n(z — |.)¢. and a flux density in the negative z direction equal to —4n(z + 
|.)¢,. Here n(z — |,) means the particle concentration at z — /,. The net particle 


Figure 14.5 Opposite ends of the system are in diffusive 
contact with reservoirs at chemical potentials 1, and sl2. The 
temperature is constant everywhere. 
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flux density is the average over all directions on a hemisphere of 


J,7 = 43[n(z — 1.) — n(z + 1.) Je, = =e 1 (20) 


We want to express the average value of c./_ in terms of cl. Here |, = |cos 0 
is the projection of the mean free path, and T, = Ccos 0 is the projection of the 
speed on the z axis. The average is taken over the surface of a hemisphere, 
because all forward directions are equally likely. The element of surface area 


is 2x sin 0 d0. Thus 


2n Jf" cos? Osin 0 d0 
Ly = aa ae = tl, (21) 
so that 
J = 4 (22) 
dz 


D= x. | (23) 


The particle diffusion problem is the model for other transport problems. In 
particle diffusion we are concerned with the transport of particles; in thermal 
conductivity with the transport of energy by particles; in viscosity with the 
transport of momentum by particles; and in electrical conductivity with the 
transport of charge by particles. The linear transport coefficients that describe 
the processes are proportional to the particle diffusivity D. 

Let p4 denote the concentration of the physical quantity A. If A is a quantity 
like charge or mass that has the same value for all the molecules, then the flux 
density of A in the z direction is 


JE = pasr.) (24) 


where <v,» is the mean drift velocity of the particles in the z direction. The drift 
velocity is zero in thermal equilibrium. 

If A is a quantity like energy or momentum that depends on the velocity of 
a molecule, then we always find a similar expression: 


Ja = fap Sez) (25) 


Thermal Conductivity 


where /, is a factor with magnitude of the order of unity. The exact value of fy 
depends on the velocity dependence of A and may be calculated by the method 
of the Boltzmann transport equation treated at the end of this chapter. For 
simplicity we set f, = 1 in this discussion. By analogy with (19) for particle 
diffusion, the phenomenological law for the transport of A is 

ee 


J, = —Degradp, , (26) 
with the particle diffusivity D given by (22). 


Thermal Conductivity 


Fourier’s law 
J, = —Kgradt (27) 


describes the energy flux density J, in terms of the thermal conductivity K and 
the temperature gradient (Figure 14.6). This form assumes that there is a net 
transport of energy, but not of particles. Another term must be added if addi- 
tional energy is transported by means of particle flow, as when electrons flow 
under the influence of an electric field. 

The energy flux density in the z direction is 


SFr Nin (28) 
where <v,) is the mean drift velocity; p, is the energy density. This result is 
valid within a factor of the order of unity, as discussed. By analogy with the 


diffusion equation, the right-hand side is equal to 


—Ddp,/dx = —D(6p,/0t)(dt/dx). (29) 


Reservoir | 


Ty 


Figure 14.6 Opposite ends of the system are in thermal 
contact with reservoirs at temperatures t, and t. 
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This describes the diffusion of energy. Now ép,/0t is just the heat capacity 
per unit volume, denoted by Cy. Thus 


J, = —DC, gradt; (30) 


on comparison with (27) the thermal conductivity is 
K = DC, = 4¢,zl. | (31) 


The thermal conductivity of a gas is independent of pressure until at very low 
pressures the mean free path becomes limited by the dimensions of the appara- 
tus, rather than by intermolecular collisions. Until very low pressures are 
attained there is no advantage to evacuating a Dewar vessel, because the heat 
losses are independent of pressure as long as (31) applies. 


Viscosity 


Viscosity is a measure of the diffusion of momentum parallel to the flow velocity 
and transverse to the gradient of the flow velocity. Consider a gas with flow 
velocity in the x direction, with the flow velocity gradient in the z direction. The 
viscosity coefficient is defined by 


dv 
X, = —0 G2 = Jelps) (32) 


Here v, is the x component of the flow velocity of the gas; p, denotes the x 
component of momentum; and X, is the x component of the shear force exerted 
by the gas on a unit area of the xy plane normal to the z direction. By Newton’s 
second law of motion a shear stress X, acts on the xy plane if the plane receives 
a net flux density of x momentum J.(p,), because this flux density measures the 
rate of change of the momentum of the plane, per unit area. 

In diffusion the particle flux density in the z direction is given by the number 
density n times the mean drift velocity <v.> in the z direction, so that J," = 
név,> = —Ddn/dz. In the viscosity equation the transverse momentum density 
is nMv,; its flux density in the z direction is (nMv,)<v.>. By analogy with (26) 
this flux density equals —Dd(nMv,)/dz, within a factor of the order of unity. 
With p = nM as the mass density, 


JA(px) = pv,<vz> = —Dpdv,/dz = —ndv,/dz. (33) 


Viscosity 


Thus, with D given by (23), 


n = Dp = ipel (34) 


gives the coefficient of viscosity. The CGS unit of viscosity is called the poise. 
The mean free path is ] = 1/nd?n from (13), where d is the molecular diameter 
and n is the concentration. Thus the viscosity may be expressed as 


n = Me/3nd? , (35) 


which is independent of the gas pressure. The independence fails at very high 
pressures when the molecules are nearly always in contact or at very low 
pressures when the mean free path is longer than the dimensions of the 
apparatus. 

Robert Boyle in 1660 reported an early observation on the pressure inde- 
pendence of the damping of a pendulum in air: 


Experiment 26.... We observ'd also that when the Receiver was full of Air, 
the included Pendulum continu'd its Recursions about fifteen minutes (or a 
quarter of an hour) before it left off swinging; and that after the exsuction of 
the Air, the Vibration of the same Pendulum (heing fresh put into motion) 
appear’d not (by a minutes Watch) to last sensibly longer. So that the event of 
this experiment being other than we expected, scarce afforded us any other 
satisfaction, than that of our not having omitted to try it. 


Although at first glance implausible, this result is readily understood. With 
decreasing pressure the rate of momentum-transfer collisions decreases, but 
each colliding particle comes from farther away. The larger the distance, the 
larger the momentum difference; the increasing momentum transfer per collision 
cancels the decreasing collision rate. 

It is easier to measure the viscosity than the diffusivity. If D = n/p as predicted 
by (34), then K is related to n by 


K = 1Cy/p. (36) 
The observed values of the ratio Kp/nC, given in Table 14.3 are somewhat 


higher than the value unity predicted by our approximate calculations. Im- 
proved calculations of the kinetic coefficients K, D, n take account of minor, 
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Table 14.3 Experimental values of K, D, y, and Kp/nCy at 0°C and 1 atm 


Gas  K,inmWcm=!K~!—_D,incm?s~! _—, inupoise §~— Kp/nCy 


He 1.50 — 186. 2.40 
Ar 0,18 0.158 210. 2.49 
H, 1,82 1,28 84. 1.91 
N2 0.26 = 167. 1.91 
O, 0.27 = 189. 1.90 


Note: Values of the thermal conductivity are at 300 K. 


but difficult, effects we have neglected; see the works cited in the general 
references. 


Comment. The diffusivity of gas atoms is directly proportional to their viscosity. The 
diffusivity of a particle suspended in a liquid or gas is a different problem: the viscosity of 
the solvent opposes the diffusion of the suspended particle. We find D x 1/y, where D 
refers to the particles and 7 refers to the liquid. The Stokes-Einstein relation for suspended 
particles is D = t/6nmR, where R is the radius of the sphere in suspension. 


Comment. The quantity v = n/p is called the kinematic viscosity; if (34) holds, v should 
be equal to the diffusivity D. The ratio n/p enters into hydrodynamic theory and into the 
Reynold’s number criterion for laminar flow. 


Generalized Forces 


The transfer of entropy from one part of a system to another is a consequence 
of any transport process. We can relate the rate of change of entropy to the 
flux density of particles and of energy. By analogy with the thermodynamic 
identity at constant volume, 


gee dO oe (37) 
t s 


we write the entropy current density J, as 


J, ==J, — =I (38) 


Generalized Forces 

Let @ denote the entropy density; let @¢/ét denote the net rate of change of 
entropy density at a fixed position r. Then, by the equation of continuity, 

06/0t = g, — div J,. (39) 


In a unit volume element the net rate of appearance of entropy is equal to the 
rate of production g, minus the loss — diy J, attributed to the transport current. 

In a transfer process U and N are conserved. The equation of continuity for 
the energy density u is 


ou ; 
FF: = —-div J,; (40) 


the equation of continuity for the particle concentration n is 


” = —divJ,,. (41) 
ot 


Let us take the divergence of J, in (38): 


divJ, = : div J, + J,,* grad(1/z) 
— (n/t) div J, — J, * grad(y/2). (42) 
Let (37) refer to unit volume; we take a partial derivative with respect to time 
to obtain the net rate of entropy change: 
===> HL. (43) 
We use (40)-(43) to rearrange (39) in a form suggestive of the ohmic power 
dissipation; 
Jo = J,° grad(1/z) + J, * grad(—/t) , (44) 


or 


Go = Jy" F, + Jn F,. (45) 
Here F, and F,, are generalized forces defined by 


F, = grad(1/z);_—-F,, = grad(—y/z). (46) 
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Einstein relation 


In an isothermal process F,, in (46) may be written as F,, = (—1/t)grady or, 
in terms of the internal and external parts of the chemical potential, as 


F, = —(1/s)[grad tin + grad Hex] (47) 


For an ideal gas jij, = tlog(n/ng), so that grad pj, = (t/n)gradn; for an 
electrostatic potential grad p1.,, = q grad = —qE. Thus 


F, = —(1/z)[tn™' gradn — gE] (48) 


Now the particle flux density also has two terms, written as 
J, = —D,gradn + nfiE , (49) 


where D, is the diffusivity and fi is the mobility, which is the drift velocity per 
unit electric field. The ratio of the coefficients of gradn to E is D,/nji in (49) 
and t/nq in (48). These ratios must be equal, so that 


D, = th/d , (50) 


which is called the Einstein relation between the diffusivity and the mobility 
for a classical gas. 


Comment. We gain an advantage, for reasons related to the thermodynamics of irrevers- 
ible processes, if we use F, and F,, in (46) as the driving forces for the linear transport 
processes. We write 


J, = LiF, + LiF; J, = LaF, + L2F, . (51) 
The Onsager relation of irreversible thermodynamics is that 
L,{B) ba! L;(—B) (52) 


where B is the magnetic field intensity, If B = 0, then Lj; = Lj, always, For (52) to hold, 
the driving forces F must be defined as in (46). Other definitions of the forces are perfectly 
valid, such as the pair grad t and grad n, but do not necessarily lead to coefficients L that 
satisfy the Onsager relation. For a derivation see the book by Landau and Lifshitz cited 
in the general references. 
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Kinetics of Detailed Balance 


KINETICS OF DETAILED BALANCE 


Consider a system with two states, one at energy A and one at energy —A. In 
an ensemble of N such systems, N* are at A and N~ are at —A, with N = 
N* + N-. Toestablish thermal equilibrium there must exist some mechanism 
whereby systems can pass between the two states. Consider the rate equation 
for transitions into and out of the upper state: 


dN*/dt = aN~ — BN*, (53) 


where «, f may be functions of the temperature. The transition rate from — to 
+ is directly proportional to the number of systems in the — state. The transi- 
tion rate from + to — is directly proportional to the number of systems in the 
+ state. 

In thermal equilibrium ¢dN*/dt) = 0, which can be satisfied only if 


a/B = <N*>/<N~> = exp(—2A/z) , (54) 


the Boltzmann factor. This result expresses a relation between a(t) and f(r) that 
must be satisfied by any and every mechanism that assists in the transitions. As 
an example, suppose that the transition + + — proceeds with the excitation 
of a harmonic oscillator from a state of energy se to a state of energy (s + l)e; 
in the inverse process — + + the oscillator goes from se to (s — 1)e. In the 
quantum mechanical theory of the oscillator it is shown that 


B_ Prob(s>s+i1)_ s+1 
a Prob(s>s—1) 5 


for the excitation and de-excitation of the oscillator, a result derived in most 
texts on quantum theory. The value of ¢s) is found from the Planck distribution: 


= 1 : ‘ _ __ exp(e/t) 
~ exp(e/t) — 1” Sas exp(e/t) — 1’ 


<s> 
so that, with e = 2A to conserve energy, 
a/B = <s>/<s + 1) = exp(—2A/2). (55) 
This satisfies the condition (54). 


The principle of detailed balance emerges as a generalization of this argument: 
in thermal equilibrium the rate of any process that leads to a given state must 
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equal exactly the rate of the inverse process that leads from the state. One 
common application of the principle is to the Kirchhoff law for the absorption 
and emission of radiation by a solid, already discussed in Chapter 4: radiation 
of a wavelength that is absorbed strongly by a solid will also be emitted 
strongly—otherwise the specimen would heat up because it could not come into 
thermal equilibrium with the radiation. 


ADVANCED TREATMENT: 
BOLTZMANN TRANSPORT EQUATION 


The classical theory of transport processes is based on the Boltzmann transport 
equation. We work in the six-dimensional space of Cartesian coordinates r and 
velocity y. The classical distribution function f(r,v) is defined by the relation 


f(t.v)dr dv = number of particles in dr dy. (56) 
The Boltzmann equation is derived by the following argument. We consider 
the effect of a time displacement dt on the distribution function. The Liouville 


theorem of classical mechanics tells us that if we follow a volume element along 
a flowline the distribution is conserved: 


f(t + dtr + dry + dy) = f(tr.y) , (57) 
in the absence of collisions. With collisions 


f(t + dtr + dry + dv) — f(tr,y) = dt(Of/2t)contisions: (58) 
Thus 
dt(af/et) + dr grad, f+ dv+ grad, f =dt(Af/et)eon- (59) 


Let « denote the acceleration dv/dt; then 


Of/ot + v° grad, f+ a: grad, f =(Af/ét)con- (60) 


This is the Boltzmann transport equation. 
In many problems the collision term (@f/@t).9, may be treated by the introduc- 
tion of a relaxation time t,(r,v), defined by the equation 


(Af/Obeon = —(f — fo)/te- (61) 


Particle Diffusion 


Here fo is the distribution function in thermal equilibrium. Do not confuse t, 

for relaxation time with t for temperature. Suppose that a nonequilibrium 

distribution of velocities is set up by external forces which are suddenly removed. 

The decay of the distribution towards equilibrium is then obtained from (61) as 
af = fo) _ _f =o 


a ma ans ey 


if we note that df)/@t = 0 by definition of the equilibrium distribution. This 
equation has the solution 


(f — fol = (f — fo):=0€xp(—t/t,). (63) 


It is not excluded that t, may be a function of r and y. 
We combine (56), (60), and (61) to obtain the Boltzmann transport equation 
in the relaxation time approximation: 


— 


6 
| ot a grad, f+ ¥> erad, f==- 


ests (64) 


In the steady state Of/0t = 0 by definition. 


Particle Diffusion 


Consider an isothermal system with a gradient of the particle concentration. 
The steady-state Boltzmann transport equation in the relaxation time approxi- 
mation becomes 


v,df/dx = —(f — fo)/t. 5 (65) 


where the nonequilibrium distribution function f varies along the x direction. 
We may write (65) to first order as 


Si = fo — Uxtedfo/dx , (66) 


where we have replaced df/0x by dfy/dx. We can iterate to obtain higher order 
solutions when desired. Thus the second order solution is 


Sr = fo — vxtedfi/dx = fo — vxt-Mfo/dx + v,71,7 d*fo/dx?. (67) 


The iteration is necessary for the treatment of nonlinear effects. 
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Classical Distribution 


Let fy be the distribution function in the classical limit: 
fo = exp[(u — 8)/t] (68) 


as in Chapter 6. We are at liberty to take whatever normalization for the 
distribution function is most convenient because the transport equation is 
linear in f and fj. We can take the normalization as in (68) rather than as in 
(56). Then 


dfy/dx = (dfo/du)(dp/dx) = (fo/t)(du/dx) , (69) 
and the first order solution (66) for the nonequilibrium distribution becomes 
S = fo — (vxtefo/(dn/dx). (70) 
The particle flux density in the x direction is 


ys J vf Dede , (71) 


where ®() is the density of orbitals per unit volume per unit energy range: 


1 /2M\3? 
D(e) = ia (Fr) aes, (72) 


as in (7.65) for a particle of spin zero. Thus 
J = Je<fo Borde — (du/dx) flor tfo/7)Belde (73) 


The first integral vanishes because v, is an odd function and f is an even func- 
tion of v,. This confirms that the net particle flux vanishes for the equilibrium 
distribution f. The second integral will not vanish. 

Before evaluating the second integral, we have an opportunity to make use 
of what we may know about the velocity dependence of the relaxation time r,. 
Only for the sake of example we assume that t, is constant, independent of 
velocity; t, may then be taken out of the integral: 


Int = —(du/ds)(t_/2) f v2foD(e)de. (74) 


Fermi-Dirac Distribution 


The integral may be written as 
; 2 ; 
4 if r%foBlo\de = { (Mv?) foD(ede = nt/M , (75) 


because the integral is just the kinetic energy density 3nt of the particles. Here 
{foD(e\de = nis the concentration. The particle flux density is 


J,* = —(nt,/M)(dpu/dx) = —(t,t/M)(dn/dx) , (76) 


because 4 = tlogn + constant. The result (76) is of the form of the diffusion 
equation with the diffusivity 


D =t,t/M = 4Xv?)r,. (77) 


Another possible assumption about the relaxation time is that it is inversely 
proportional to the velocity, as int, = |/v, where the mean free path | is constant. 
Instead of (74) we have 


In® = —(dy/dxy(l/2) [ (o.2/0)foBle)de. (78) 
and now the integral may be written as 
1 f p =1yn 
§ | vfoD(elde = 4ne , (79) 


where T is the average speed. Thus 


J = —4(Ien/t)(dpfjdx) = —4e(dn/dx) , (80) 
and the diffusivity is 

D =4ie. (81) 
Fermi-Dirac Distribution 


The distribution function is 


1 


fo = opie = wal +1 
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To form dfo/dx as in (69) we need the derivative dfo/du. We argue below that 
dfo/du = de — 1) , (83) 


at low temperatures t « yu. Here 6 is the Dirac delta function, which has the 
property for a general function F(e) that 


f% FeOdste — pode = F(p). (84) 
Now consider the integral | F(e)(dfo/du)de. At low temperatures dfo/dy is very 


large for ¢ ~ y and is small elsewhere. Unless the function F(e) is very rapidly 
varying near 4 we may take F(e) outside the integral, with the value F(,): 


2 


Jo FOdfo/dwde ~ Fu) [afo/dwde = — Flu) [-(afo/dedde 
FUL fole]e = Fw fol0) . (85) 


where we have used dfy/du = —dfo/de. We have also used fy = 0 for eg = 0. 
At low temperatures f(0) ~ 1; thus the right-hand side of (85) is just F(40), 
consistent with the delta function approximation. Thus 


dfo/dx = d(e — pdu/dx. (86) 
The particle flux density is, from (71), 
J,* = —(dp/dx)t, fredte — wDlede , (87) 


where t, is the relaxation time at the surface ¢ = y of the Fermi sphere. The 
integral has the value 


4v,7(3n/2ep) = n/m , (88) 


by use of D(x) = 3n/2e, at absolute zero, from (7.17), where e¢ = 4mv;? 
defines the velocity vp on the Fermi surface. Thus 


J,* = —(nt,/m)dp/dx. (89) 
At absolute zero (0) = (h?/2m)(32?n)?/3, whence 


dp/dx = {3(h?/2m)(3n?)?/3/n"/3\dn/dx 
= F(e,/n)dn/dx , (90) 


Laws of Rarefied Gases 
so that (87) becomes 
J, = —(2t,/3m)ey dn/dx = —4vp?t, dn/dx. (91) 
The diffusivity is the coefficient of dn/dx: 
D = 4v,?r, , (92) 


closely similar in form to the result (77) for the classical distribution of velocities. 
In (92) the relaxation time is to be taken at the Fermi energy. 

We see we can solve transport problems where the Fermi-Dirac distribution 
applies, as in metals, as easily as where the classical approximation applies. 


Electrical Conductivity 


The isothermal electrical conductivity o follows from the result for the particle 
diffusivity when we multiply the particle flux density by the particle charge q 
and replace the gradient du/dx of the chemical potential by the gradient 
qdo/dx = —qE, of the external potential, where E, is the x component of the 
electric field intensity. The electric current density follows from (76): 


J, = (nq?z,/mE;  o = ngq?z,/m, (93) 


for a classical gas with relaxation time t,. For the Fermi-Dirac distribution, 
from (89), 


J, = (nq?t,/m)E: o = nq?t,/m. (94) 


LAWS OF RAREFIED GASES 


Thus far in this chapter the discussion of transport has assumed that the 
molecular mean free path is short in comparison with the dimensions of the 
apparatus. At a gas pressure of 10° atm at room temperature, the mean free 
path ofa molecule is of the order of 25 cm. The diameter of a laboratory vacuum 
system connection may be of the order of 25cm, thus of the order of the mean 
free path. We may usefully draw a line here and denote pressures lower than 
1 x 107°atm as high vacuum. This pressure is approximately 0.1 Nm~? or 
1 x 10°-°kgcm~? or 7.6 x 10-*mmHg or 7.6 x 10~*torr. The Knudsen 
region of pressures is understood to be the region in which the mean free path 
is much greater than the dimensions of the apparatus. A knowledge of the 
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behavior of gases in this pressure region is important in the use of high vacuum 
pumps and allied equipment. 

The terminology recommended by the American Vacuum Society is expressed 
in terms of torr, where 1 torr = 1 mm Hg = 1.333 x 107* bar = 133.3Nm~? = 
1333dynecm™~?; here 1 bar = 10°dynecm~* = 0.987 standard atmospheres. 
Then: 


high vacuum 10° 3-10~° torr 


very high vacuum 107 °-10~° torr 


ultra high vacuum _ below 107° torr. 


Flow of Molecules Through a Hole 


In the Knudsen regime we do not need to solve a hydrodynamic flow problem 
in order to get the rate of efflux of gas molecules through a hole, because the 
molecules do not see each other. We have merely to calculate the rate J, at 
which molecules strike unit area of surface per unit time. We find for the flux 
density 


Jn = 42, (95) 


where n is the concentration and @ is the mean speed of a gas molecule. To 
prove (95), consider a unit cube containing n molecules. Each molecule strikes 
the +z face of the cube 42, times per unit time, so that in unit time 4nz_ molecules 
strike unit area. 

We solve for @, in terms of ¢. Because c, = ccos@, we require the average of 
cos @ over a hemisphere: 


2n ifs cos@sindd 4 


<cos 0) = = = 
Qn Je” sinoao 


(96) 


rl 


Therefore t, = 42, and (95) is obtained. The expression (95) for the flux forms 
the basis for many calculations of gas flow in vacuum physics in the Knudsen 
regime. 

If A is the area of the hole, the total particle flux, which is the number of 
molecules per unit time, is 


® = 1Anc = nS, (97) 


Flow of Molecules Through a Hole 


where 


S = 7Az (98) 


Bie 


The conductance S of the hole is defined as the volume of gas per unit time flowing 
through the hole, with the volume taken at the actual pressure p of the gas. The 
conductance is usually expressed in liters per second. For the average air 
molecule at T = 300K we have ¢ = 4.7 x 10*cms7!; for a circular hole of 
10cm diameter, (98) leads to a conductance of 917 liter/sec, roughly 1000 
liter/sec. 

For a hole with a given conductance the total particle flux is proportional to 
the concentration n or, because p = nt, to the pressure p: 


S= 19 (99) 


Here we have defined the quantity 
Q=pS, (100) 


sometimes called the throughput, which is widely used by vacuum physicists as 
a convenient measure of the flow. The quantity Q is numerically (not dimen- 
sionally) equal to the gas volume flowing per unit time, but referred to the 
volume at unit pressure, in whatever units are used to express pressure. Vacuum 
physicists like to express pressure in torr, hence they usually express flux in 
torr-liters per second, From the ideal gas law one finds that 1 torr-liter at 
300 K is equivalent to 5.35 x 107‘ mole or 3.22 x 10!° molecules. 

Our calculations have expressed the flow of gas through a hole into a perfect 
vacuum. With gas on both sides, the net flux from side 1 to side 2 will be 


Pic, nt) (101) 


A® = 4A(nyZ, — 12%) = 1a(2 -“3 


T% 1 


The condition for zero net flux is 


a 1/2 
Eee = ) ; (102) 


Pz 72% 


using the proportionality of ¢ to t'/?. In the Knudsen regime equal pressures 
do not imply zero net flux if the temperatures on the two sides are different. 
At equal pressures gas will flow from the cold side to the hot side; zero gas 
flow requires a higher pressure on the hot side. 
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If t, = t2, Eq. (101) can be written 
1 1 
A® = (ny — n,)S = = (Pi — P2)S = = AQ (103) 


where 
AQ = pS — p2S = Q; — Q3. (104) 


Example: Flow through a long tube. We assume that the molecules which strike the inner 
wall of the tube are re-emitted in all directions; that is, the reflection at the surface is 
assumed to be diffuse. Thus when there is a net flow there is a net momentum transfer to 
the tube, and we must provide a pressure head to supply the momentum transfer. Let u be 
the velocity component of the gas molecules parallel to the wall before striking the wall. 
We estimate the momentum transfer to the wall on the assumption that every collision 
with the wall transfers momentum Mu). The rate of flow down the tube is nA<u), where 
A is the area of the opening. The rate at which molecules strike the wall is, from (95) 


inLdnc , (105) 


where d is the diameter and L the length of the tube. The momentum transfer to the tube 
must equal the force due to the pressure differential Ap: 


4nLdntM<uy = A Ap. (106) 


We solve for the flow velocity <u> to obtain 
d 
(ay = = (107) 


The net flux is 


Ad Ap 
A® = n{uyA = AP rab as Ss, (108) 
where 
t Ad 
S = t A®/Ap = —-— 
tA@/Ap = san (109) 


is the conductance of the tube, defined analogously to the conductance of a hole, Eq. (97). 


Speed of a Pump 


A more detailed calculation, with averages over the velocity distribution taken more 
carefully, leads to a conductance differing from (109) by a factor 8/3z: 


3 
Sak Bae (110) 
3xMcL = 3McL 


The conductance of a tube cannot be larger than that of a hole with the same area. From 
(98), (110), and (121) below, 


Soe cd 4d 
Stube/Shoie = In M@L Fi 3L’ (111) 


This ratio will be larger than unity for 3L < 4d. In writing (106) we assumed implicitly 
that every molecule hits the tube wall, This will not be true for a short tube. For our result 
to be valid we must suppose that the tube is long enough to make the ratio (111) be small 
compared to unity, which means 


L >» 4d. (112) 


Using our earlier example for the conductance of a hole, we find that the conductance of 
a tube | meter long and 10 cm in diameter is about 122 liter/sec, for air at 300 K. 


Speed of a Pump 


The speed ofa pump is defined similarly to the conductance of a hole or ofa tube; 
it is defined as the volume pumped per unit time, with the volume taken at the 
intake pressure of the pump. The same symbol S is used as for conductance; 
sometimes the conductance of an aperture or a tube is referred to as its speed. 
The product Q = pS for a pump is often called the throughput of the pump. 

Ifa pump of speed S, evacuates a vacuum system through a tube of conduc- 
tance S,, the effective pumping speed S,,, of the combination is given by 


1 1 
rautarery ts (113) 
Ser Sp, 5, 


just as for the conductance of two electrical conductors in series. 


Proof: Let p; denote the pressure at the input end of the tube, and let p, 
denote the pump intake pressure at the output end of the tube. Continuity of 
flux requires that 


PiSere = (P1 — P2)S; = P2Sp » (114) 


417 


418 


Chapter 14: Kinetic Theory 


so that 


tet Pear mie CREE (115) 


equivalent to (113). 

The relation (113) for S.;, explains why in high vacuum systems the connec- 
tions between the pump and the vessel to be evacuated must be as short and of 
as large a diameter as possible. A long and narrow connecting tube makes poor 
use of a high speed pump. Further, the speed of the pump itself cannot be larger 
than the conductance of its own aperture. 

How rapidly does a pump with effective speed S evacuate a volume V? From 
the ideal gas law pV = Nr, and from the definition of pump speed analogous 
to Eq. (99) we find 


dp _tdN_ Q_ ps 
at Vd. Vas UES) 


If the pump speed is independent of pressure, this differential equation has the 
solution 


P(t) = p(O)exp(—t/to); to = VIS. (117) 


For a volume of 100 liters connected to a pump with a speed of 100 liter/sec, the 
pressure should decrease by 1/e per second. 

Any user of vacuum technology soon discovers that the pumpdown of a 
vacuum system proceeds much more slowly in the high and ultrahigh vacuum 
regions than expected on the basis of pumping speed and system volume. The 
desorption of surface gas predominates—often by many orders of magnitude— 
over volume gas. The surface emits adsorbed molecules as fast as the pump 
evacuates molecules from the volume. 


SUMMARY 
1. The probability that an atom has velocity in dv at v is 
P(v)dv = 4n(M/2nt)?/*v? exp(— Mv?/2r)dv , 


the Maxwell velocity distribution. 


Problems 


2. Diffusion is described by 
J, = —Dgradn; D= iil, 
where is the mean speed and | is the mean free path. 
3. Thermal conductivity is described by 
J, = —-Kgradt; K =4C,al, 
where Cy refers to unit volume. 


4. The coefficient of viscosity is given by 
n = pel , 


where p is the mass density. 


5. According to the principle of detailed balance, in thermal equilibrium the 
rate of any process that leads to a given state must equal exactly the rate of the 
inverse process that leads from the state. 


6. The Boltzmann transport equation in the relaxation time approximation is 


ws + a-grad, f+v- grad, f= f= So. 
ot ve 


7. The electrical conductivity of a Fermi gas is 
o =nq°t,/m , 


where t, is the relaxation time. 


PROBLEMS 


1. Mean speeds in a Maxwellian distribution. (a) Show that the root mean 
square velocity Uyms iS 


Vrms = <v?>"/? = (3t/M)*?. (118) 


Because <v?) = <v,”) + <v,?> + <v,?> and <v,?) = <v,7> = <v,7), it fol- 
lows that 


<v2)"? = (t/M)"? = vymq/3", (119) 
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The results can also be obtained directly from the expression in Chapter 3 for 
the average kinetic energy of an ideal gas. (b) Show that the most probable 
value of the speed v,,, is 


Ump = (2¢/M)*/?. (120) 


By most probable value of the speed we mean the maximum of the Maxwell 
distribution as a function of v, Notice that vj) < U;ms- () Show that the mean 
speed T is 


7 = ff dvwP() = (Bx/nM)"?. (121) 
The mean speed may also be written as <|v|>. The ratio 
Uema/@ = 1.086. (122) 


(d) Show that @., the mean of the absolute value of the z component of the 
velocity of an atom, is 


t. = <|v.|> = 4¢ = (2t/nM)*?. (123) 


2. Mean kinetic energy ina beam. (a) Find the mean kinetic energy in a beam 
of molecules that exits from a small hole in an oven at temperature t. (b) Assume 
now that the molecules are collimated by a second hole farther down the beam, 
so that the molecules that pass through the second hole have only a small 
velocity component normal to the axis of emission. What is the mean kinetic 
energy? Comment: The molecules in the beam do not collide and are not in real 
thermal equilibrium after they have exited from the oven. The gas left in the 
oven is depleted with respect to fast molecules, and the residual gas will cool 
down if it is not reheated by heat flowing in through the walls of the oven. 


3. Ratio of thermal to electrical conductivity. Show for a classical gas of 
particles of charge q that 


K/to = 3/2q? , or K/To = 3k,?/2q (124) 
in conventional units for K and T. This is known as the Wiedemann-Franz ratio. 
4. Thermal conductivity of metals. The thermal conductivity of copper at 


room temperature is largely carried by the conduction electrons, one per atom. 
The mean free path of the electrons at 300K is of the order of 400 x 107 *cm. 


Problems 


The conduction electron concentration is 8 x 10** percm>. Estimate (a) the 
electron contribution to the heat capacity; (b) the electronic contribution to the 
thermal conductivity; (c) the electrical conductivity. Specify units. 


5. Boltzmann equation and thermal conductivity. Consider a medium with 
temperature gradient dt/dx. The particle concentration is constant. (a) Employ 
the Boltzmann transport equation in the relaxation time approximation to 
find the first order nonequilibrium classical distribution: 


3 é\, dt 
f= fo- val 3 + 5) fae (125) 


(b) Show that the energy flux in the x direction is 


dt Se, 
j= - (E). fe(-Z oh = fee > (126) 


where v,? = 2e/3m. (c) Evaluate the integral to obtain for the thermal conduc- 
tivity K = Sntt,/m. 


6. Flow through a tube. Show that when a liquid flows through a narrow tube 
under a pressure difference p between the ends, the total volume flowing through 
the tube in unit time is 


V = (na*/8nL)p , (127) 


where 17 is the viscosity; L is the length; a is the radius. Assume that the flow is 
laminar and that the flow velocity at the walls of the tube is zero. 


7. Speed of a tube. Show that for air at 20°C the speed of a tube in liters per 
second is given by, approximately, 

120° 
L + 4d 


; (128) 


where the length L and diameter d are in centimeters; we have tried to correct 
for end effects on a tube of finite length by treating the ends as two halves of a 
hole in series with the tube. 
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Chapter 15; Propagation 


The purpose of this terminal chapter is to bring within the compass of the text 
the most important problems in the propagation of heat and the propagation 
of sound, both classical subjects that are part of an education in thermal physics. 


HEAT CONDUCTION EQUATION 


Consider first the derivation of the diffusion equation, which is found from the 
Fick law (14.19) for the particle flux density: 


J, = —D,gradn , (1) 


where D, is the particle diffusivity and n the particle concentration. The equation 
of continuity, 


P 
5 + divs, =0, (2) 
assures that the number of particles is conserved. Because div grad = V?, 
substitution of (1) in (2) gives 


— = D,V?n. (3) 


This partial differential equation describes the time-dependent diffusion of the 
particle concentration n. 

The thermal conductivity equation is derived similarly. By (14.27—14.30) we 
have in a homogeneous medium 


J, = —K gradt. (4) 
The equation of continuity for the energy density is 


a OT 


ce + divJ,=0, (5) 


Dispersion Relation, w Versus k 


where C is the heat capacity per unit volume. We combine (4) and (5) to obtain 
the heat conduction equation 


(6) 


This equation describes the time-dependent diffusion of the temperature. The 
equation is of the form of the particle diffusion equation (3). The quantity D, 
is called the thermal diffusivity ; for a gas it is approximately equal to the particle 
diffusivity, as in (14.23). 


[EEE SS. 
Comment. The eddy current equation of electromagnetic theory* has the same form as 
(3) and (6), If B is the magnetic field intensity, then 


oB 3 
< = D,V?B. 7 
cre EEN (7) 


The constant Dy may be called the magnetic diffusivity and in SI is equal to 1/oy; in CGS, 
D, = c2/4noy. It has the dimensions (length)? (time)~' and is directly proportional to the 
(skin depth)? times the frequency. When we have solved one equation, say (3), we have 
solved three problems. 


(eT ET ee 


Dispersion Relation, w Versus k 
We look for solutions of the diffusivity equation 
DV?0 = d0/at (8) 
that have the wavelike form 
6 = Ooexp[i(k:r — wt)] , (9) 


with as the angular frequency and k as the wavevector. Plane wave analysis is 
an excellent approach to this problem, even though it will turn out that the 
diffusion waves or heat waves are so highly damped that they are hardly waves 


* See, for example, W. R. Smythe, Static and dynamic electricity, McGraw-Hill, 3rd ed., 1968, p. 369. 
This book has an unusually full treatment of eddy current problems. 
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at all. Substitute (9) in (8) to obtain the relation between k and w: 
Dk? = io. (10) 
A relation c(k) for a plane wave is called a dispersion relation. 


Penetration of Temperature Oscillation 
Consider the variation of temperature in the semi-infinite medium z > 0 when 
the temperature of the plane z = 0 is varied periodically with time as 


0(0,t) = Oo cosat , (11) 


which is the real part of 0) exp(—ict), for real 9. Then in the medium z > 0 
the temperature is 


0(z,t) = Oo Refexp[i(kz — ct)]} 
= Oy Re{exp[i*/?(w/D)'/?z — iwt]} , (12) 


where Re denotes real part and i*/? = (i — 1)/,/2. Thus, with 6 = (2D/w)"/?, 


(2,1) = Oo Re{exp(—z/d) exp[i(z/6) — iwt]} 
= 0 exp(—2z/6) cos(wt — 2/8). (13) 


The quantity 6 = (2D/)'/ has the cimensions of a length and represents the 
characteristic penetration depth of the temperature variation: at this depth the 
amplitude of the oscillations of 0 is reduced by e~!. The characteristic depth is 
called the skin depth if we are dealing with the eddy current equation. The 
wave is highly damped in the medium—the wave amplitude decreases by e~! 
in a distance equal to a wavelength/2z. 

If the thermal diffusivity of soil is taken as D = 1 x 10~3cm?s~!, then the 
penetration depth of the diurnal cycle of heating of the ground by the sun and 
cooling of the ground by the night sky (w = 0.73 x 107*s~') is 


L(diurnal) = (2D/m)/? = Sem. 
For the annual cycle, 


L(annual) = 1m. 


Development of a Pulse 


A layer of 10cm of earth on top of a cellar will tend to average out day/night 
variations of surface temperature, but the summer/winter variation at the top 
of the cellar requires several meters of earth. Actual values of the thermal 
diffusivity are sensitive to the composition and condition of the soil or rock. 
Notice that a figure of merit for cellar construction involves the thermal 
diffusivity, and not the conductivity alone. 


Development of a Pulse 


In addition to the wavelike solutions of the form (9), the diffusion equation has 
several other useful forms of solutions. We confirm by insertion in (8) that 


(x,t) = (4nDt)~ '!? exp(—x?/4Dt) (14) 


is a solution. The proportionality factor has been chosen so that 
+a 
Jo 20Gs.hdx = 1. (15) 


The solution (14) corresponds to the time development ofa pulse which at t = 0 
has the form of a Dirac delta function 4(x), sharply localized at x = 0, and zero 
elsewhere. 

The pulse might be a temperature pulse, as when a pulsed laser or pulsed 
electron beam heats a surface briefly. Let Q be the quantity of heat deposited on 
the surface, per unit area. The temperature distribution is then given by 


(x,t) = (20/Cy)(4nDt)~ 1? exp(—x?/4D0) , (16) 


where Cy is the heat capacity per unit volume of the material. The function is 
plotted in Figure 15.1. The factor 2 arises because all heat is assumed to flow 
inwards from the surface, while for the solution (14) symmetrical flow was 
assumed. Another example of the application of (14) is the diffusion of impurities 
deposited on the surface of a semiconductor, to form a p-n junction inside the 
semiconductor, 

The pulse spreads out with increasing time. The mean square value of x is 
given by 


(x2) = f x70(xyt)dx | ‘| O(x,t\dx = 2Dt , (17) 
after evaluating the Gaussian integrals. The root mean square value is 


Xyme(t) = <x?)1? = (2D1)'?, (18) 
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Figure 15.1 Plot of spread of temperature pulse with time, for 4nD = 1, 
from Eq. (16). Att = 0 the pulse is a delta function. 


TEMS Se 


Comment. This result shows that the width of the distribution increases as t'/2, which is 
a general characteristic of diffusion and random walk problems in one dimension. It is 
quite unlike the motion of a wave pulse in a nondispersive medium, which is a medium 
for which @ = vk, where v is the constant velocity. The connection with Brownian motion 
or the random walk problem follows if we let to be the duration of each step of a random 
walk; then 1 = Nt, where N is the number of steps. It follows that 


Xrme(t) = (2Dto)"/?N1?? , (19) 
so that the rms displacement is proportional to the square root of the number of steps. 


This is the result observed in studies of the Brownian motion, the random motion of 
suspensions of small particles in liquids. 


BS 9 eh Oe 


Time-Independent Distribution 


Diffusion with a Fixed Boundary Condition at x = 0 


If a solution of (8) is differentiated or integrated with respect to any of its 
independent variables, the result may again be a solution. An important example 
is obtained by integrating (14) with respect to x: 


O(x,t) = (4nD0)~* Li dx’ exp(—x'2/4Dt) 
1 pu 
= Fg lodemp-s) = dete , (20) 
where u = x/(4Dt)'/?. Here we have introduced the error function defined by 
erfu = oe fe ds exp(—s?). (21) 
Vn 


Tables of the error function are readily available. The error function has the 
properties 


erf(0) = 0; lim erf(x) = 1. (22) 
x40 
Of particular practical interest is the diffusion of heat or of particles into an 
infinite solid from a surface at x = 0, with the fixed boundary condition 0 = 05 
at x = Oand 0 = Oat x = oo. (For t < 0 we assume 0 = 0 everywhere.) The 
solution is 


(x,t) = Oo[1 — erf(x/(4D0)"?)]. (23) 
Again we see that the distance at which (x,t) reaches a specified value is propor- 


tional to (4Dt)'/?. The application of this solution to the diffusion of impurities 
into a semiconductor is discussed in a problem. 


Time-Independent Distribution 
Let us look at a solution of (8) that is independent of the time. The diffusivity 
equation reduces to the Laplace equation 

V70 = 0. (24) 


Consider a semi-infinite medium bounded by the plane z = 0 and extending 
along the positive z axis. Let the temperature vary sinusoidally in the boundary 
plane: 


O(x,y,0) = Oo sin kx. (25) 
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The solution of (24) in the medium is 
O(x,y,2) = Qo sin kx exp(—kz). (26) 


The temperature variation is damped exponentially with the distance from the 
boundary plane. The temperature distribution in the time-independent problem 
must be maintained by constant heat sources on the boundary plane z = 


PROPAGATION OF SOUND WAVES IN GASES 


Results developed earlier in this book can be applied to the study of sound waves 
in gases. Thermal effects are important in this problem. Let dp(x,t) denote the 
pressure associated with the sound wave; the form of the wave may be written 
as 


dp = dpoexpLi(kx — wt)] , (27) 


where k is the wavevector and q is the angular frequency. The wave propagates 
in the x direction. 
We suppose the equation of state is that of an ideal gas: 


pV =Nt, or p=pt/M , (28) 


where p = NM/V is the mass density, and M is the mass of a molecule. The 
force equation referred to unit volume is 


du _ op tédp pa 


ot Ox Méx Méx (29) 


p 


Here u is the x component of the velocity of a volume element. The motion is 
subject to the equation of continuity 


dp/ét + div(py) = 0 , (30) 


or, in one dimension, 


6p/dt + Apu)/dx = 0. (31) 
The thermodynamic identity is 


dU + pdV =tdo , (32a) 


Propagation of Sound Waves in Gases 


which can also be written 


dU dv do 
= +P = 


V V av" Ges) 


If we assume (pending discussion below) that there is no entropy exchange 
during the passage of a sound wave, Eq. (32a) becomes 


Cy(6r/at) + (p/V)(OV/at) = 0 , (33) 
where Cy is the heat capacity at constant volume, per unit volume. We can re- 
write the second term in terms of dp/ét because p = NM/V and (1/V)(dV/ét) = 
—(1/p)(6p/ét). Now the thermodynamic identity appears as 

Cy(dt/6t) — (p/p)(Gp/0t) = 0. (34) 
Let us define the fractional deviations s, 0 by 

p=poll+s); t= (1 +6), (35) 
where po, To are the density and temperature in the absence of the sound wave. 
We assume that u, s, 0 have the form of a traveling wave: exp[i(kx — wt)]. The 

three equations (29), (31), (34) that govern the motion now become 
—iwpu + ik[(tpo/M)s + (pto/M)0] = 0; (36) 
— impos + ik(pous + pu) = 0; (37) 


0. (38) 


—iwtoCyO + iw(p/p)pos 


We assume that at sufficiently small wave amplitudes it is a good approxima- 
tion to neglect in these equations terms in the squares and cross products of 
u, s, and 0. For example, pu = po(1 + s)u becomes pou if the cross product su is 
neglected. The equations thus reduce to, with the subscripts dropped from 
p and t, 


qu — (kt/M)s — (kt/M)0 = 0; (39) 
os — ku =0; (40) 


tCyO — ps = 0; or C/O —ns =0, (41) 
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where n is the concentration. These equations have a solution only if 


@ = (yt/M)!?k , (42) 
where y = (Cy + n)/Cy = C,/Cy in our units. The velocity of sound is 
v, = Ow/0k = (yt/M)*?. (43) 


This result applies to monatomic gases from the lowest frequencies up to high 
frequencies limited only by the requirement that the acoustic wavelength should 
be much larger than the mean free path of the atoms. This requirement is the 
criterion for the applicability of the hydrodynamic approach embodied in the 
force equation (29). 


Thermal Relaxation 


With polyatomic gases (43) is valid at low frequencies, but as the frequency is 
increased there is a transition frequency region above which the velocity of 
sound increases. The transition region between low frequency and high fre- 
quency propagation is associated with relaxation effects. 

Thermal relaxation describes the establishment of thermal equilibrium in a 
system. Energy dissipation results when all parts of a system are not at the same 
temperature; the dissipation is strongest when the period of the heating and 
cooling half-cycle in the sound wave is comparable with the time required for 
heat exchange between the different degrees of freedom of the system. In 
polyatomic gases under standard conditions there are time delays of the order 
of 10~*s in the transfer of energy between the internal vibrational states of a 
molecule and the external translation states. 

Let the heat capacity C, and temperature t, = to(1 + 0,) refer to the internal 
states, while Cy and t = to(1 + @) refer to the translational states. Then (34) 
becomes 


C,(0t,/0t) + Cy(et/0t) — (p/p)(Op/ot) = 0 . (44) 
or, in place of (38), 
—iw@toC,0, — iwtoCyO + ia(p/p)pos = 0. (45) 


Suppose that the transfer of energy between the internal and external states has 
the characteristic time delay tf) such that 


6t,/0t = —(t — 7)/to . (46) 


Thermal Relaxation 


or 
io, = (0 — 0;)/to. (47) 


Here fo is called the relaxation time. There will be separate relaxation times for 
the rotational-translational transfer and the vibrational-translational transfer. 
We combine (39), (40), (45) and (47) to obtain the dispersion relation 


Cy + Cy + iwtyCy 


k? = w(M : 
Oe Ce iwtoC, 


(48) 


where Cy, C, refer to the translational states alone. In the low frequency limit 
to « 1 and 


Cy +C 
= ee = 0*(M/yot) , (49) 
where 7 is the low frequency limit of the total heat capacity ratio (C, + C,)/ 
(Cy + C,). The low frequency limit of the velocity of sound is 


v,(0) = (yot/M)!/?, (50) 
In the high frequency limit wt) > 1 and 
= w*(M/2)(Cy/C,) = w(M/)). (51) 


Here y,, refers only to the translational states: at high frequencies the internal 
States are not excited by the sound wave. The high frequency limit of the velocity 
of sound is 


v,(00) = (yq,t/M)'/?, (52) 


Values of yo are given in Table 15.1; if no internal states at all are excited, 
Yo =i. 

The wave is attenuated when k is complex; the imaginary part of k gives the 
pressure attenuation coefficient «. From (48) it is found that the maximum 
absorption per wavelength occurs when w = 27/to and is given approximately 
by 

G- Cy Cy 

(@ Cy +C," 


P 


(53) 


‘ T 
(CA) max om 
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Table 15.1 Ratio C,/Cy = y for gases 


Gas Temperature, °C y 


Air 17 1.403 
H,0 100 1.324 
H; 15 1.410 
02 ~181 1.450 
15 1.401 

200 1,396 

2000 1,303 

co; 15 1.304 
Ar 15 1.668 
He —180 1.660 


Note: For a monatomic ideal gas, C,/Cy = 5/3 = 1.667, as 
for Ar and He. For a diatomic gas at a temperature high 
enough to excite the rotational motion, C,/Cy = 7/5 = 1.40, 
as for O, and H, at room temperature; at temperatures 
sufficiently high to excite also the vibrational motion, 
C,/Cy = 9/7 = 1.286, as for O, at 2000°C. The values given 
are of 79, applicable to static processes and to sound waves 
in the limit of low frequencies. For very high frequency 
sound waves only the translational motion is excited and 
} = 5/3 is applicable. 


For CO, gas at the relaxation frequency of 20kHz under standard conditions 
the intensity is observed to decrease by 1/e in about 4 wavelengths—a massive 
absorption, in agreement with theory. 


0 SET EN saa IIE S LEST SSS FEES Ss SOE AS 
Example: Heat transfer in a sound wave. Equation (33) expresses the isentropic assump- 
tion: the equation neglects the thermal conductivity which gives rise to some transfer of 
thermal energy within the sound wave between successive warm and cool half cycles. The 
assumption that do = 0 must be modified to take account of heat flow. The heat conduction 
equation (6) may be written as 

K 672/8x? = 706/ot , (54) 
where @ is the entropy density. Then (34) becomes 

Cy(0t/ét) — (p/p\(@p/ét) = K(é?t/6x*) , 


or 


—iwtCy0 + iops = —Ktk?. (55) 


Summary 
When we use this in place of (41), the dispersion relation c(k) becomes 
Cy + iWk? 
k? = o(M, ul ri 56 
OMI Gime & 


with W = K/o, At low frequencies Wk? is much smaller than Cy, so that the sound velocity 
is equal to the isentropic result v, = (yot/M)!!?, as before. The condition Wk? « Cy, is 
essentially the condition | < A, where | is the molecular mean free path and / is the wave- 
length of the sound wave. The attenuation of the pressure oscillation is given by the imag- 
inary part of the wavevector k and is denoted by «. The result from (56) in the low frequency 
region Wk? « Cy is that 


& = (yo — 1)pKw?/2v,°C, , (57) 


where C, refers to unit volume. 


SUMMARY 


1, The heat conduction equation is the partial differential equation that follows 
when the phenomenological transport equation (here the Fourier law) is 
combined with the equation of continuity. We obtain 


ot 
a DV*7; D,= 


The time-dependent diffusion equation and the eddy current equation have 
the same form, so that their solutions may be translated from the solutions 
of the heat conduction equation, these being often more familiar in the 
literature. 


3. Frequently it is useful to construct solutions in the form of superpositions 
of plane waves of the form 


6 = Ooexp[i(k: r — wt)]. 


The differential equation then gives the relation between w and k, called the 
dispersion relation of the problem. 


~ 


The propagation of sound waves in gases depends on the rate of exchange of 
energy between the translational, rotational, and vibrational motions of a 
molecule. A low frequency sound wave is described by isentropic, and not 
isothermal, parameters—a result that seems paradoxical at first sight. 
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PROBLEMS 


1. Fourier analysis of pulse. Consider a distribution that at the initial time 
t = 0 has the form of a Dirac delta function 6(x). A delta function can be 
represented by a Fourier integral: 


Abe Olee ble J, dkexptike, (58) 


At later times the pulse becomes 


O(x,t) = =I. dkexp[i(kx — ot)] (59) 
or, by use of (10), 
O(x,t) = = f  dkexplikx — Dk?) (60) 


Evaluate the integral to obtain the result (14). The method can be extended to 
describe the time development of any distribution given at ¢ = 0. If the distribu- 
tion is f(x,0), then by the definition of the delta function 


$00) = fdr’ flx,0)5(x = x’). (61) 
The time development of d(x — x’) is 
O(x — x’,t) = (4nDt)~* exp[—(x — x’)?/4Dt] . (62) 
by (14). Thus at time ¢ the distribution f(x,0) has evolved to 
S(xt) = (4nD0)~* fax f(x O)expl—(x — x')?/4Dr]. (63) 


This is a powerful general solution. 


2. Diffusion in two and three dimensions. (a) Show that the diffusion equation 
in two dimensions admits the solution 


02(t) = (C2/t)exp(—17/4D1) (64) 


and in three dimensions 


03(t) = (C3/t?/?) exp(—r?/4D0). (65) 


Problems 


(b) Evaluate the constants C, and C3. These solutions are analogous to (14) 
and describe the evolution of a delta function at t = 0. 


3. Temperature variations in soil. Consider a hypothetical climate in which 
both the daily and the annual variations of the temperature are purely sinusoidal, 
with amplitudes 0, = 10°C. The mean annual temperature 0) = 10°C. Take 
the thermal diffusivity of the soil to be 1 x 1073 cm? s~!. What is the minimum 
depth at which water pipes should be buried in this climate? 


4. Cooling of a slab. Suppose a hot slab of thickness 2a and initial uniform 
temperature 0, is suddenly immersed into water of temperature 0) < 0,, 
thereby reducing the temperature at the surface of the slab abruptly to 0) and 
keeping it there. Expand the temperature in the slab in a Fourier series. After 
some time all but the longest wavelength Fourier component of the temperature 
will have decayed, and then the temperature distribution becomes sinusoidal. 
After what time will the temperature difference between the center of the slab 
and its surface decay to 0.01 of the initial difference 0, — 09? 


5. p-n junction: diffusion from a fixed surface concentration. Suppose a sili- 
con crystal is p-type doped with a concentration of n, = 10'°cm~* of boron 
atoms. If the crystal slab is heated in an atmosphere containing phosphorus 
atoms, the latter will diffuse as donors with a concentration n,(x) into the semi- 
conductor. They will form a p-n junction at that depth at which n, = ny. 
Assume that the diffusion conditions are such that the phosphorus concentra- 
tion at the surface is maintained at n,(0) = 10'7cm~*. Take the diffusion 
coefficient of donors to be D = 10~'.cm?s~'. What is the value of the constant 
C in the equation x = Cr'/?, where x is the depth of the p-n junction and t is 
the time? 


6. Heat diffusion with internal sources. When internal heat sources are pres- 
ent, the continuity equation (5) must be modified to read 


Co + div, = Ou « (66) 
where g,, is the heat generation rate per unit volume. Examples include Joule 
heat generated in a wire; heat from the radioactive decay of trace elements 
inside the Earth or the Moon. Give an expression for the temperature rise at the 
center of (a) a cylindrical wire and (b) the spherical Earth, on the assumption 
that g,, is independent of position and is constant with time. 


7. Critical size of nuclear reactor. Extend the considerations of the preceding 
problem to particle diffusion, and assume that there is a net particle generation 
rate g, that is proportional to the local particle concentration, g,, = n/to, where 
ty is a characteristic time constant. Such behavior describes the neutron genera- 
tion in a nuclear reactor. The value of tg depends on the concentration of 7*5U 
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nuclei; if no surface losses took place, the neutron concentration would grow 
as exp(t/tg). Consider a reactor in the shape of a cube of volume L* and assume 
that surface losses pin the neutron surface concentration at zero. Show that 
Eq. (3), if augmented by a generation term g, = n/to, has solutions of the form 


n(x,y,2,t) © exp(t/t,) cos(k,.x) cos(kyy) cos(k,z) , (67) 


where k,L, k,L and k_L are integer multiples of z. Give the functional depen- 
dence of the net time constant t, on k,, k,, k, and to, and show that for at least 
one of the solutions of the form (67) the neutron concentration grows with time 
if L exceeds a critical value L,,;,. Express L,, as a function of D, and to. In 
actual nuclear reactors this increase is ultimately halted because the neutron 
generation rate g, decreases with increasing temperature. 
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Some Integrals Containing 
Exponentials 


THE GAUSS INTEGRAL 
Let 
ie | is exp(—x2)dx = 2, exp(—x?)dx. (1) 


The following trick is used to evaluate Jp. Write (1) in terms of a different 
integration variable: 


Io ar exp(—y?)dy. (2) 
Multiply (1) and (2) and convert the result to a double integral: 


po nae exp(— x?) dx ie exp(—y2)dy = sie bees exp — (x? + y*)dxdy. 
(3) 
This is an integral over the entire x—y plane. Convert to polar coordinates r and 


@, as shown in Figure A.1. Then, x? + y? = r?, and the area element dA = 
dx dy becomes dA = rdrdg: 


I? = J'| fivese—P rr to =2n if exp(—r?)rdr. 


Because of d[exp(—r?)] = —2exp(—r?)rdr, the integral over r is elementary: 
1,2 = =n fdfexp(-r?)] = [rex] mae T. 
r=0 
Thus 


Ip = ii exp(—x?)dx = 2 Jo exp(—x?) ax = 72, | (4) 
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Figure A.1 


The area element dA = rdrdg. 


Some Integrals Containing Exponentials 


GENERALIZED GAUSS INTEGRALS, 
AND GAMMA FUNCTION INTEGRALS 


Integrals of the form 
ita: 2 fxmexp(—x?) dx, (m> 1), (5) 


where m need not be an integer, may be reduced to the widely tabulated gamma 
function I'(z), by the substitutions x? = y, 2dx = y~*dy: 


Ty = fe verray =T(n + 1), n = (m — 1)/2. (6) 


The integral in (6) may be viewed as the definition of T(z) for noninteger positive 
values of z. 
The gamma function satisfies the recursion relation 


T(n + 1) = n(n). (7) 


It is easily obtained for n > 0 from (6) by integration by parts, and it is used to 
extend the definition (6) of T(z) to negative values of z. By using (7) repeatedly 
it is always possible to reduce T(z) for arbitrary argument to a value in the 
interval0 <z< 1. 


The Stirling Approximation 
For m = 0,n = —4; from (4): 
Ip = [Py te dy = 1) = n¥?. (8) 


If mis an even integer, m = 2] > 0, nis a half-integer, and n = | — 4, then we find 
by repeated application of (7), with the aid of (8), that 


In = 2 [x exp(—x?) dx = foyer ay 


=T(l+)=(-) x -Yx x Fx Ex Wl? (9) 
For m= 1,n = 0: 
l= 2 [.*xexp(—x7)dx = feenray =T(1) =1. (10) 


If m is an odd integer, m = 2! + 1 > 1, nis an integer, n = | > 0, and we find 
similarly, with the aid of (10), 


Taig. = 2 fx" exp(—x) dx = fe veray 
=Pl+il=Hlixd—-)x«+x2x1l=l! (11) 


The gamma function for positive integer argument is simply the factorial of 
the integer preceding the argument. 


THE STIRLING APPROXIMATION 


For large values of n, n! can be approximated by 


1 1 
! ~ (2nn)!/2n" - —+0Q— 2 
n (2nn)"/7n' cae n+ on ()| (12a) 


or 


1 1 
logn! = Flog2x + (n + 4)logn —n + Da +0() (12b) 


Here the term 1/12n is the first term of an expansion by powers of 1/n, and 
0(1/n?) stands for omitted higher order terms in this expansion, of order 1/n? 
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or higher. In practice, even the term 1/12n is usually omitted. Its principal role 
is to check on the accuracy of the approximation. If the effect of the 1/12n- 
correction introduces only a change below the desired accuracy, the entire 
expression has the desired accuracy. 

To derive (12) we write, in accordance with (11), 


n!= fowte-*ax = is exp[ f(x) ]dx , (13) 


where 


J (x) = nlogx — x. (14) 


We make the substitution 


x=n+ynt=n(l+ yn), dx =ntdy. (15) 

Then 
J(x) = nlogn — n + g(y), (16) 

where 
g(y) = n[log(1 + yn-*) — yn=*]. (17) 

With these, 

exp[ f(x)] = n"e"exp[g(y)] . (18) 
n! = n/?n"e-" hes exp[g(y)]dy. (19) 


The function g(y) has its maximum at y = 0: g(0) = 0. Using the Taylor 
expansion of the logarithm, 


log(1 + s) = s — 4s? + 45° — 4st +--+, (20) 
with s = (y?/n)!/*, we expand g(y): 


1 y? 1 y? 3/2 
4-3 +3(2 a 


—2y? + O(s*). (21) 


g(y) 


ll 


The Stirling Approximation 


In the limit n > co, s > 0, and all but the first term in (21) vanish, and the 
integral in (19) becomes 


JiasexplaQildy = [7% exp(—y?2)dy = 2x)? , 22) 
with the aid of (4). If (22) is inserted into (19) the result is identical to (12a) except 


for the correction term 1/12n. Its derivation is a bit tedious. We work with 
logn! and write 


A 1 
logn! = 4log2x + (n + Jlogn —n + aut (5). (23a) 
If we replace n by n — 1, 


log(n — 1)! = 4log2x + (n — 4$)log(n — 1) — (n — 1) 


A 1\ 
tae of): (23b) 


We subtract (23b) from (23a): 


! 
logn! — log(n — 1)! = near = logn 


= (n + 4)logn — (n — 4)log(n — 1) — 1 
ai, a of). (24) 
n n-1 n 


where all omitted terms are now at least of order 1/n°, The two terms in A 
can be combined: 


Ee: - -4+0(3). (25) 


If this is inserted into (24), we find 


A n 1 
aie (n — 4) log lag t+ () (26) 


ii 
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For large n the logarithm may be expanded according to (20), with s = —1/n: 
n 1 1 { 1 1 

== mill eee et i fess 27 

eer -1 loa(1 :) Hi Dnt aan () (27) 


op n 1 1 1 1 1 
(n 2) log Lith, On 3H? meee 


(28) 


I 
ins 
is) 
rs) 
fe 
Ss 
en 
=) 
rm) 
See 


If this is inserted in (26) we see that A = 1/12. 

We are often interested not in n! but only in logn!, and only to an accuracy 
such that the relative error between an approximate value of logn! and the 
true value decreases with increasing n. Such an approximation is obtained by 
neglecting all terms in (12b) that increase less rapidly than linearly with n: 


logn! ~ nlogn — n. (29) 


Appendix B 
Temperature Scales 
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DEFINITION OF THE KELVIN SCALE 


Numerical values of temperature*’’ are not expressed in practice in terms of the 
fundamental temperature t, whose unit is the unit of energy, but on the (absolute) 
thermodynamic temperature scale T, the Kelvin scale, whose unit is the kelvin, 
symbol K. The kelvin was defined in 1954 by international agreement as the 
fraction 1/273.16 of the temperature T, of the triple point of pure water. Hence, 
by this definition, T, = 273.16 K, exactly. This temperature is 0.01 K above the 
atmospheric-pressure freezing point of water (the ice point), Ty = 273.15K. 
The triple point is more easily and accurately reproducible than the ice point. 
The triple point establishes itself automatically in any clean evacuated vessel 
that is partially backfilled with pure water and cooled until part but not all of 
the water is frozen, leading to an equilibrium between solid ice, liquid water, 
and the water vapor above the ice-water mixture. 

The Celsius temperature scale ¢ is defined in terms of the Kelvin scale, by 


t= T — 273.15K, () 


Temperatures on this scale are expressed in degrees Celsius, symbol °C. Tem- 
perature differences have the same value on both Kelvin and Celsius scales. 

The conversion factor kg between the fundamental temperature t and the 
Kelvin temperature, 


m= kel, (2) 


* We appreciate the assistance of Norman E. Phillips in the preparation of this appendix. 

‘ The ultimate survey of the state of development of precise temperature measurements is the 
proceedings of an international symposium taking place every few years under the title Temperature, 
its measurement and control in science and industry, The proceedings are published under the same 
title. Vol. 1: C. O, Fairchild, editor; Reinhold, 1940. This volume is largely obsolete. Vol. 2: E. C. 
Wolfe, editor; Reinhold, 1955. Although no longer reflecting the state of the art, this volume is still 
useful for its thorough introductory discussions of principles of various methods of temperature 
measurements, not all of which are repeated in the later volumes. Vol. 3 (3 parts): C. M. Herzfeld, 
editor; Reinhold, 1962. Perhaps the most useful volume because of its introductory discussion of 
principles of various methods of temperature measurements. Vol. 4 (3 parts): H. H. Plumb, editor; 
Instrument Society of America, Pittsburgh, 1972. Most useful for representing the state of the art 
of various methods of temperature measurements; contains less introductory review material 
than Vol. 3. 
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is called the Boltzmann constant. Its numerical value must be determined 
experimentally; the best current value* is 


p = (1.380662 + 0.000044) x 107*®ergK~}. (3) 


The value of the Boltzmann constant is determined with the aid of certain 
model systems whose structures are sufficiently simple that one can calculate 
the energy distribution of the quantum states, and from it the entropy as a 
function of the energy, ¢ = o(U). The fundamental temperature as a function 
of the energy is 7(U) = (d0/0U)~'. Examples of model systems used in the 
determination of k, are the following: 


(a) Ideal gas. In the limit of low particle concentration all gases behave as 
ideal gases, satisfying pV = NkgT. One obtains ky by measuring the pV product 
of a known amount of gas at a known kelvin temperature T, extrapolated to 
vanishing pressure. The determination of the number of particles N invariably 
involves the Avogadro constant N 4, independently known. 


(6) Black body radiation. We can obtain kg by fitting the measured spectral 
distribution of a black body of known Kelvin temperature T to the Planck 
radiation law (Chapter 4). Because this law involves t through the ratio hw/t = 
ho/kgT, this determination requires the independent knowledge of Planck’s 
constant. 


(c) Spin paramagnetism. In the limit of vanishing interaction the magnetic 
moment M of a system of N spins in a magnetic field B, at temperature t, is 
given by Eq. (3.46). Various paramagnetic salts, such as cerous magnesium 
nitrate (CMN) are good approximations to noninteracting spin systems if the 
temperature is not too low. By fitting measured values of M as a function of B/T 
to (3.46) we can determine the ratio m/kg, where m is the intrinsic magnetic 
moment of the electron, known independently. Usually only the low-field 
portion is used, in which case the number of spins must also be known, which 
involves again N4. Precision results require correction for weak residual spin 
interactions, similar to corrections for particle interactions in a gas. 

The kg value given in (3) is a weighted average of several determinations. 
With an uncertainty of about 32 parts per million, it is one of the least accurately 
known fundamental constants. Most of this uncertainty is due to the difficulty 
of the measurements and to the nonideality of the systems used for these 
measurements. About 5 parts per million are due to the limited accuracy with 
which h and N, are known. 


* E. R. Cohen and B. N. Taylor, J. Phys, Chem. Reference Data 2, No. 4 (1973). 


Primary and Secondary Thermometers 


When expressing temperature as conventional Kelvin temperature T rather 
than fundamental temperature t, it is customary to absorb the Boltzmann 
constant into the definition of a conventional entropy S, 


S = kgo. (4) 


The relation dQ = tdo between reversible heat transfer and entropy transfer 
then becomes 


do = TAs. (5) 


PRIMARY AND SECONDARY THERMOMETERS 


Any accurately measurable physical property X whose value is an accurately 
known function of the temperature, ¥ = X(T), may be used as a thermometric 
parameter to measure the temperature of the system possessing the property 
X and of any system in thermal equilibrium with it. Used in this way, the 
system with the property X is a thermometer. The principles underlying 
the most commonly used thermometers are listed in Tables B.1 and B.2. The 
thermometers listed in Table B.2 are called secondary thermometers, defined 
as thermometers whose temperature dependence X(T) must be calibrated 
empirically, by comparison with another thermometer whose calibration is 
already known. The calibration of all secondary thermometers must ultimately 
be traceable to a primary thermometer. But once calibrated, secondary ther- 
mometers are easier to use and are more reproducible than the primary ther- 
mometers available at the same temperature. 

Any calculable model system that can be used to determine the value of the 
Boltzmann constant k, can be used as a primary thermometer, and the three 
model systems discussed above are the most important primary thermometers 
(Table B.1). 

The precision and accuracy of thermometers vary greatly. Precision is 
expressed by the variation AT observed when the same temperature is measured 
at different times with the same instrument. Accuracy is expressed by the 
uncertainty AT with which the thermometer reproduces the true Kelvin scale. 
Secondary thermometers based on electrical resistance measurements may 
achieve a precision of 1 part in 10°. The precision of thermometers based on 
mechanical pressure measurements is much poorer, particularly at low pres- 
sures. For example, helium vapor pressure thermometers at the lower end of 
their useful range have a precision of about 1 part in 10°. The accuracy of 
secondary thermometers is limited by the accuracy of the primary thermometers 
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Thermodynamic Thermometry 


Table B.2 Principles of the most important secondary 


thermometers 
Useful range 
Physical property inK 

Thermoelectric voltage of thermocouples* 400-1400 
Thermal expansion of liquid in glass 200-400 
Electrical resistance 

metals* | 14-700 

semiconductors (germanium)* 0.05-77 

commercial carbon resistors’ 0.05-20 
Vapor pressure of liquefied gas 

“He 1-5.2 

3He 0.3-3.2 


NOTE: The temperature ranges are approximate ranges of wide utility, 
not ultimate limits. 

* Used as interpolating instrument in the IPTS. 

t Widely used as cryogenic laboratory thermometer; each specimen 
must be individually calibrated to deliver usable accuracy. 


used to calibrate them. The accuracy of primary thermometers is limited by 
their relatively poor precision and by residual variations between different 
thermometers. As a rough estimate, the present-day accuracy of primary 
thermometers is about 1 part in 10* above 100K, about 1 part in 10° around 
1K, and about 1 part in 10? near 0.01 K. 


THERMODYNAMIC THERMOMETRY 


It is possible to perform primary thermometry without relying on the theoreti- 
cally known properties of simple model systems, by somehow utilizing the 
relation (5). We give three examples. 


(a) Carnot cycle. Consider a Carnot cycle operating between a known tem- 
perature T , and the unknown temperature T,. Because of entropy conservation 
the heat transfers at the two temperatures satisfy Q,/T; = Q2/T. The un- 
known temperature can be determined by measuring the ratio of the two heat 
transfers. The method is not very practical. 


(6) Magnetic calorimetry. Suppose a paramagnetic substance is initially at a 
known temperature T,, in a magnetic field B,. Let the substance be cooled by 
isentropic demagnetization to the unknown temperature T,. If now a known 
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small amount dQ of heat is added to the substance, its entropy is raised by 
dS = dQ/T,. The substance is then isentropically re-magnetized, and the 
magnetic field B, is determined at which the temperature has returned exactly 
to T,. The field B, will be found slightly different than B,:B, = B, + dB. 
Entropy conservation requires 


dS = dQ/T, = S(T;,B,) — S(T,,B,) = (0S/CB); dB. (6) 


From the thermodynamic identity for the Helmholtz free energy for a magnetiz- 
able substance, 


dF = —SdT — MaB, (7) 
one obtains, by the usual cross-differentiation, the Maxwell relation 

(0S/0B), = (OM/0T),. (8) 
We insert (8) into (6) to find the expression for the unknown temperature: 

Ty = (dQ/dB)(0M/0T)g. (9) 


The quantities dQ at T = T, anddBat T = T, are known, and the temperature 
derivative of M at T = T, and B = B, is easily measured. The method makes 
no assumptions about the ideality of the paramagnetic substance, and it has 
therefore been used extensively at low temperatures. 


(c) Clausius-Clapeyron thermometry. The melting temperature T,, of a sub- 
stance varies with pressure p according to the Clausius-Clapeyron equation of 
Chapter 10: 


dT,,/dp = T,, AV/AH , (10) 
where AV is the volume change during melting, and AH the latent heat of 


fusion. If both quantities have been measured as functions of pressure, (10) can 
be integrated: 


T,/T, = exp li (AV/AH)dp. (11) 


If T, and p, are known, a measurement of the pressure p, at which the unknown 
temperature T, is the equilibrium melting temperature permits calculation of 


International Practical Temperature Scale (IPTS) 


T> from (11). By utilization of the strong temperature dependence of the 
solidification pressure of liquid *He, the method has been used as an alternative 
to magnetic thermometry at low temperatures. 


INTERNATIONAL PRACTICAL TEMPERATURE 
SCALE (IPTS) 


Many known phase equilibrium temperatures can be reproduced far more 
precisely than the accuracy with which their exact location on the Kelvin scale 
can be determined by primary thermometry. To facilitate practical thermometry, 
a number of easily reproducible phase equilibrium temperatures have been 
determined as accurately as possible and have been assigned best values to 
define an International Practical Temperature Scale (IPTS). On the IPTS the 
selected equilibrium points are treated as if their temperatures were known to be 
exactly equal to their assigned values. Intermediate temperatures are determined 
by a precisely specified interpolation procedure that is chosen to reproduce the 
true Kelvin scale as accurately as possible. The present version of the scale is 
IPTS68, adopted in 1968 by international agreement, covering temperatures 
from the triple point of hydrogen (13.81 K) upward.* Table B.3 gives the 
assigned temperatures for IPTS68. 

In the range between 13.81 K and 903.89K, which is the melting point of 
antimony, a platinum resistance thermometer is used as the interpolating 
instrument. In the range from 903.89 K to 1337.58 K, the melting point of gold, 
a platinum-platinum/rhodium thermocouple is used. Above 1337.58 K black 
body radiation is used. 

Below 13.81 K no precisely defined procedure has been agreed. In the range 
between 5.2K and 13.81K various scales based on the vapor pressure of 
hydrogen are in practical use. Below 5.21 K, the critical point of *He, down to 
about 0.3K, the 1958 and 1962 helium scales* are widely used as de facto 
extensions of IPTS68. The 1958 *He scale relates the vapor pressure of *He to 
the temperature T; the 1962 *He scale uses the vapor pressure of He. 

As the accuracy of primary temperature measurements improves, errors in 
practical scales such as IPTS become uncovered, leading eventually to revision 
of the practical scales. Table B.3 lists some errors now believed to exist in 
IPTS68. 


* See, for example, American Institute of Physics handbook, 3rd ed., McGraw-Hill, 1972; Section 4: 
Heat, M. W. Zemansky, editor. Contains complete original references. 
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Table B.3 Assigned temperatures of the International 
Practical Temperature Scale of 1968 


Equilibrium point Tes, Tes — T; 
Substance Type | ink inK 
hydrogen t 13,81 
hydrogen b (250 torr) 17,042 
hydrogen b 20.28 
neon b 27.402 
oxygen t 54.361 
oxygen b 90.188 
water t 273.16 exact 
water b 373.15 0,025 
tin ‘fi 505.1181 0.044 
zine f 692.43 0.066 
silver f 1235.08 
gold Vy 1337.58 


Note: Except for the triple points and the 17.042 K point, all 
equilibria are those at a pressure of one standard atmosphere, 
Po = 101,325Nm~? (=760torr). The 17.042K point is the 
boiling point of hydrogen at 250torr. The notations f, b, and 
Jf in the second column refer to triple points, boiling points, 
and freezing points.* The last column contains estimates of 
errors known to exist, from Physics Today 29, No. 12, p. 19 
(Dec. 1976). 

* All data from the American Institute of Physics handbook, 
3rd ed., McGraw-Hill, 1972; Section 4; Heat, M. W. Zemansky, 
editor. 


Appendix C 
Poisson Distribution 


The Poisson distribution law is a famous result of probability theory. The result 
is useful in the design and analysis of counting experiments in physics, biology, 
operations research, and engineering. The statistical methods we have developed 
lend themselves to an elegant derivation of the Poisson law, which is concerned 
with the occurrence of small numbers of objects in random sampling processes. 
It is also called the law of small numbers. If on the average there is one bad 
penny in a thousand, what is the probability that N bad pennies will be found in 
a given sample of one hundred pennies? The problem was first considered and 
solved in a remarkable study of the role of luck in criminal and civil law trials 
in France in the early nineteenth century. 

We derive the Poisson distribution law with the aid of a model system that 
consists of a large number R of independent lattice sites in thermal and diffusive 
contact with a gas, The gas serves as a reservoir. Each lattice site may adsorb 
zero or one atom. We want to find the probabilities 


P(0); P(), PQ)... PON),. «= 5 


that a total of 0, 1, 2,..., N,..., atoms are adsorbed on the R sites, if we are 
given the average number <N) of adsorbed atoms over an ensemble of similar 
systems. 

Consider a system composed ofa single site. It is convenient to set the binding 
energy of an atom to the site as zero. The identical form for the distribution is 
found if a binding energy is included in the calculation, The Gibbs sum is 


GiH=1+4., (1) 


where the term 4 is proportional to the probability the site is occupied, and 
the term 1 is proportional to the probability the site is vacant. Thus the absolute 
probability that the site is occupied is 


a= : (2) 


The actual value of A is determined by the condition of the gas in the reservoir, 
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because for diffusive contact between the lattice and the reservoir we must have 
A(lattice) = A(gas), (3) 


by the argument of Chapter 5. The evaluation of A(gas) for an ideal gas was 
given in Chapter 6, 
We now extend the treatment to R independent sites. Then 


Fro ba ea az r=(i+ ar. (4) 


By the argument used in Chapter | we know that the binomial expansion of 
(O + @)* or (1 + 4)* counts once and only once every state of the system of 
R sites. Each site has two alternative states, namely © for vacant or @ for 
occupied, which corresponds in the Gibbs sum to the term 1 for 2° and the 
term A for A'. 

In the low-occupancy limit of f « 1 we have f = 4, whence 


<N) = fR =AR (5) 


is the average total number of adsorbed atoms. The Poisson distribution is 
concerned with this low-occupancy limit. We can now write (4) as 


fe g ARP af, Ge 
Fin = (: +2) = (1 a &) : (6) 


Next we let the number of sites R increase without limit, while holding the 
average number of occupied sites <N) constant. The Poisson distribution is 
concerned with infrequent events! By the definition of the exponential function 
we have 


R 
lim (1 + ©) = exp<N) , (7) 
Ra R 
so that 
iRYS 
Fin = exp<N) = exp(AR) = Y, ‘S : (8) 
A“; N! 


The last step here is the expansion of the exponential function in a power series. 


Poisson Distribution 455 
The term in 2% in %,,, is proportional to the probability P(N) that N sites 
are occupiéd. With the Gibbs sum as the normalization factor we have in the 
limit of large R: 
ARRN 4 2NR® exp(—AR) 
P(N) =~. = s : 0) 
NY Sot N! 
or, because AR = <N) from (5), 
NN exp(—<N 
P(N) = <ND* exp(=<ND) (10) 
N! 
= 
This is the Poisson distribution law. 
Particular interest attaches to the probability P(0) that none of the sites 
is occupied. From (10) we find, with <N>° = 1 and 0! = 1, 
P(0) = exp(—<N)); log P(0) = —<N)>. (11) 
Thus the probability of zero occupancy is simply related to the average number 
<N> of occupied sites. This suggests a simple experimental procedure for the 
determination of <N): just count the systems that have no adsorbed atoms. 
Values of P(N) for several values of <N) are given in Table C.1. Plots are 
given in Figure C.1 for <N> = 0.5, 1, 2, and 3. 
3 —<{N 
Table C.1 Values of the Poisson distribution function P(N) = ge sn) 
«ND 
0.1 0.3 0.5 0.7 0.9 1 2 3 4 5 
P(0) 0.9048 0.7408 0.6065 0.4966 0.4066 0.3679 0.1353 0.0498 0.0183 0.0067 
P(1) 0,0905 0.2222 0.3033. 0.3476 ~=—-0.3659~—0.3679 = 0.2707-—Ss (0.1494 0.0733 (0.0337 
P(2) 0.0045 0.0333 0.0758 0.1217 0.1647 0.1839 0.2707 = 0.2240 0.1465 (0.0842 
P(3) 0.0002 0.0033 0.0126 0.0284 0.0494 0.0613 0.1804 0.2240 0.1954 0.1404 
P(4) 0.0003 0.0016 0.0050 0.0111 0.0153 0.0902 0.1680 0.1954 0.1755 
P(5) 0.0002 0.0007 -0,0020 0.0031 «(0.0361 = 0.1008 = 0.1563 0.1755 
P(6) 0.0001 0,0003 0.0005. 0.0120 :0.0504 = :0.1042_—:0,1462 
P(7) 0.0001 0.0034 0.0216 0.0595 0.1044 
P(8) 0.0009 0.0081 0.0298 +~—-0,0653 
P(9) 0.0002 0.0027 0.0132 0.0363 
P(10) 0.0008 0.0053 0.0181 
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Figure C.1_ Poisson distribution, P versus N, for several values of <N >. 


Example: Incorrect and correct counting of states. (a) The Gibbs sum for the R sites is not 
Goa =~L+ AAP + Ate tar (12) 


Why not? 
(b) The correct sum is 


= R 
Sion = (1 + AR = 14+ RA +a Dy top Pa 2X aRNYN , 
(13) 
where 
R! 
a(R.N) = RENT 


is the binomial coefficient. Note that g(R,N) is the number of independent states of the 
system for a given number of adsorbed atoms N. The Gibbs sum is a sum over all states. 


SRE OER OES RT LB Fo Pe 


Example; Elementary derivation of PQ). Let a total of R bacteria be distributed at 
random among L dishes. Each dish is viewed as a system of many sites to which a bacterium 


Poisson Distribution 


may attach. The L dishes represent an ensemble of L identical systems. The average number 
of bacteria per dish is 


<«N> = R/L. (14) 


Each time a bacterium is distributed, the probability that a given dish will receive that 
bacterium is 1/L. The probability the given dish will not receive the bacterium is 


1 
(: - z): (15) 


The probability in R tries that the given dish will receive no bacteria is 


1\8 
P(O) = (1 = z) (16) 


because the factor (15) enters on each try. 
We may write (16) as 


z «Nyy 
P(O) = (1 - @) : (17) 
by use of <N)> = R/L. We know that in the limit of large R, 
Ny\ 
exp(—<N)) = tim(1 : ~) . (18) 
R+0 R 


by the definition of the exponential function. Thus for R > 1 and L > 1 we have 
P(0) = exp(—<N>) (19) 


in agreement with (11). 


SL SS Oa a Se ES) 


PROBLEMS 


1. Random pulses. A radioactive source emits alpha particles which are 
counted at an average rate of one per second. (a) What is the probability of 
counting exactly 10 alpha particles in 5 s? (b) Of counting 2 in 1 s? (c) Of counting 
none in 5s? The answers to (a) and (b) are not identical. 
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2. Approach to Gaussian distribution. Show that the Poisson function P(N) = 
<N>* exp(—<N))/N! closely approaches a Gaussian function in form, for 
large <N)>. That is, show when N is close to <N) that 


P(N) = Aexp[—B(N — <N)>)*] , 


where A, B are quantities to be determined by you. Hint: Work with log P(N); 
use the Stirling approximation. In the Gaussian form both A and B are functions 
of <N); in the development of the Poisson function you may find A, B are 
functions of N, but the two forms of A, B are closely equivalent over the range in 
which the exponential factor has significant values. 


Appendix D 


Pressure 
ee sc LAE RA NA NR 0 TE 


Let a pressure p, be applied normal to the faces of a cube filled with a gas or 
liquid in quantum state s. By elementary mechanics (Chapter 3) the pressure 
is equal to 


p, = —dU,/aV , (1) 


where U, is the energy of the system in the state s, We can also write the pressure 
as 


p, = —(dU/dV), , (2) 


where (dU/dV), denotes the expectation value* of dU/dV over the state s at 
volume V. It is important that we can calculate p by (2) which is at a fixed 
volume with no ambiguity about the identity of the selected state s, whereas 
(1) involves following the state through two volumes, V and V + dV, with some 
possible doubt whether the state remains the same. The ensemble average 
pressure p is the average of p, over the states represented in the ensemble: 


p = <p,» = —<(dU/dV),> (3) 


Because the number of states in the ensemble is constant, the entropy is constant, 
so that the derivative is at constant entropy. We may therefore write 


0U 
p= -(F), (4) 


The result (4) uses the energy of the system expressed as U(o,V,...); that is, 
as a function of the volume V and the entropy o—not the temperature t. It is 
the entropy and not the temperature that is to be held constant in the 
differentiation. 


* The equivalence of (1) and (2) is an example of the Hellmann-Feynman theorem of quantum 
mechanics, according to which the derivative of the hamiltonian # and energy eigenvalue U with 
respect to a parameter / are related by dU/di = <|d#/di|. The derivation may be found on 
p. 1192 of C. Cohen-Tannoudji, B. Diu, and F. Laloé, Quantum mechanics, Wiley, 1977; see also 
E. Merzbacher, Quantum mechanics, 2nd ed., Wiley, 1970, p. 442. 
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Negative Temperature 
SESE: PLT EID EO POE ES ES CL CES EES TE TS ES 


The result of Problem 2.2 for the entropy of a spin system as a function of the 
energy in a magnetic field is plotted here in Figure E.1. Notice the region in 
which (00/0U), is negative (Figure E.2). Negative t means that the population 
of the upper state is greater than the population of the lower state. When this 
condition obtains we say that the population is inverted, as illustrated in 
Figure E.3. 

The concept of negative temperature is physically meaningful for a system 
that satisfies the following restrictions: (a) There must be a finite upper limit to 
the spectrum of energy states, for otherwise a system at a negative temperature 
would have an infinite energy. A freely moving particle or a harmonic oscillator 
cannot have negative temperatures, for there is no upper bound on their energies. 
Thus only certain degrees of freedom of a particle can be at a negative tempera- 
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Figure E.1_ Entropy as function of energy for a two state system. 
The separation of the states is e = | in this example, In the 
left-hand side of the figure 6a/AU is positive, so that t is positive. 
On the right-hand side éo/@U is negative and t is negative. 


Negative Temperature 


Temperature 7 


Figure E.2 Temperature versus energy for the two 
state system. Here 


Notice that the energy is not a maximum at t = +00, 
but is a maximum at t = —0. 


ture: the nuclear spin orientation in a magnetic field is the degree of freedom 
most commonly considered in experiments at negative temperatures. (b) The 
system must be in internal thermal equilibrium. This means the states must have 
occupancies in accord with the Boltzmann factor taken for the appropriate 
negative temperature. (c) The states that are at a negative temperature must be 
isolated and inaccessible to those states of the body that are at a positive 
temperature. 

The ordinary translational and vibrational degrees of freedom of a body 
have an entropy that increases without limit as the energy increases, in contrast 
to the two state or spin system of Figure E.1. If ¢ increases without limit, then t 
is always positive. The exchange of energy between a system at a negative 
temperature and a system that can only have a positive temperature (because of 
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Figure E.3_ Possible spin distributions for various positive and 
negative temperatures. The magnetic field is directed upward. The 
negative spin temperatures cannot last indefinitely because of weak 
coupling between spins and the lattice. The lattice can only be at a 
positive temperature because its energy level spectrum is unbounded 
on top. The downward-directed spins, as att = —t,, turn over one 
by one, thereby releasing energy to the lattice and approaching 
equilibrium with the lattice at a common positive temperature. A 
nuclear spin system at negative temperature may relax quite slowly, 
over a time of minutes or hours; during this time experiments at 
negative temperatures may be carried out. 


an unbounded spectrum) will lead always to an equilibrium configuration in 
which both systems are at a positive temperature. 

Negative temperatures correspond to higher energies than positive tempera- 
tures. When a system at a negative temperature is brought into contact with a 
system at a positive temperature, energy will be transferred from the negative 
temperature to the positive temperature. Negative temperatures are hotter 
than positive temperatures. 

The temperature scale from cold to hot runs +0K,..., +300K,..., +00 K, 
—wK,..., —300K,..., —OK. Note that if a system at — 300K is brought 
into thermal contact with an identical system at 300K, the final equilibrium 
temperature is not 0K, but is +00 K. 

Nuclear and electron spin systems can be promoted to negative temperatures 
by suitable radio frequency techniques. If a spin resonance experiment is 


Negative Temperature 


carried out on a spin system at negative temperature, resonant emission of 
energy is obtained instead of resonant absorption.* A negative temperature 
system is useful as an rf amplifier in radio astronomy where weak signals must 
be amplified. 

Abragam and Proctor’ have carried out an elegant series of experiments on 
calorimetry with systems at negative temperatures. Working with a LiF crystal, 
they established one temperature in the system of Li nuclear spins and another 
temperature in the system of F nuclear spins. In a strong static magnetic field 
the two thermal systems are essentially isolated, but in the Earth’s magnetic 
field the energy levels overlap and the two systems rapidly approach equilibrium 
among themselves (mixing). It is possible to determine the temperature of the 
systems before and after the systems are allowed to mix. Abragam and Proctor 
found that if both systems were initially at positive temperatures they attained 
a common positive temperature on being brought into thermal contact. If both 
systems were prepared initially at negative temperatures, they attained a 
common negative temperature on being brought into thermal contact. If 
prepared one at a positive temperature and the other at a negative temperature, 
then an intermediate temperature was attained on mixing, warmer than the 
initial positive temperature and cooler than the initial negative temperature. 


FURTHER REFERENCES 
ON NEGATIVE TEMPERATURE 


N. F. Ramsey, “Thermodynamics and statistical mechanics at negative absolute 
temperature,” Physical Review 103, 20 (1956). 
M. J. Klein, “Negative absolute temperature,” Physical Review 104, 589 (1956). 


* E. M. Purcell and R. V. Pound, Physical Review 81, 279 (1951). 
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